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LIM Mineralization Protein Splice Variants 

BACKGROUND OF THE INVENTION 

5 1 . Field of the Invention 

The field of the invention relates generally to osteogenic cells and the 
formation of bone and boney tissue in mammalian species. Specifically, the 
invention concerns a novel family of proteins, and nucleic acids encoding those 
proteins, that enhances the efficacy of bone mineralization in vitro and in vivo. 
10 The invention provides methods for treating a variety of pathological conditions 
associated with bone and boney tissue, such as, for example, spine fusion, 
fracture repair and osteoporosis. 

2. Description of the Related Art 

15 Osteoblasts are thought to differentiate from pluripotent mesenchymal 

stem cells. The maturation of an osteoblast results in the secretion of an 
extracellular matrix which can mineralize and form bone. The regulation of this 
complex process is not well understood but is thought to involve a group of 
signaling glycoproteins known as bone morphogenetic proteins (BMPs). These 

20 proteins have been shown to be involved with embryonic dorsal-ventral 

patterning, limb bud development, and fracture repair in adult animals. B. L. 
Hogan, Genes & Develop. . 10:1580 (1996). This group of transforming growth 
factor-beta superfamily secreted proteins has a spectrum of activities in a 
variety of cell types at different stages of differentiation; differences in 

25 physiological activity between these closely related molecules have not been 
clarified. D. M. Kingsley, Trends Genet. . 10:16 (1994). 

To better discern the unique physiological role of different BMP signaling 
proteins, we recently compared the potency of BMP-6 with that of BMP-2 and 
BMP-4, for inducing rat calvarial osteoblast differentiation. Boden et a/., 

30 Endocrinology . 137:3401 (1996). We studied this process in first passage 
(secondary) cultures of fetal rat calvaria that require BMP or glucocorticoid for 
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initiation of differentiation. In this model of membranous bone formation, 
glucocorticoid (GC) or a BMP will initiate differentiation to mineralized bone 
nodules capable of secreting osteocalcin, the osteoblast-specific protein. This 
secondary culture system is distinct from primary rat osteoblast cultures which 

5 undergo spontaneous differentiation. In this secondary system, glucocorticoid 
resulted in a ten-fold induction of BMP-6 mRNA and protein expression which 
was responsible for the enhancement of osteoblast differentiation. Boden et 
a/., Endocrinology . 138:2920 (1997). 

In addition to extracellular signals, such as the BMPs, intracellular 

10 signals or regulatory molecules may also play a role in the cascade of events 
leading to formation of new bone. One broad class of intracellular regulatory 
molecules are the LIM proteins, which are so named because they possess a 
characteristic structural motif known as the LIM domain; ^The LIM domain is a 
cysteine-rich structural motif composed of two special zinc fingers that are 

15 joined by a 2-amino acid spacer. Some proteins have only LIM domains, while 
others contain a variety of additional functional domains. LIM proteins form a 
diverse group, which includes transcription factors and cytoskeletal proteins. 
The primary role of LIM domains appears to be in mediating protein-protein 
interactions, through the formation of dimers with identical or different LIM 

20 domains, or by binding distinct proteins. 

In LIM homeodomain proteins, that is, proteins having both LIM domains 
and a homeodomain sequence, the LIM domains function as negative 
regulatory elements. LIM homeodomain proteins are involved in the control of 
cell lineage determination and the regulation of differentiation, although LIM- 

25 only proteins may have similar roles. LIM-only proteins are also implicated in 
the control of cell proliferation since several genes encoding such proteins are 
associated with oncogenic chromosome translocations. 

Humans and other mammalian species are prone to diseases or injuries 
that require the processes of bone repair and/or regeneration. For example, 

30 treatment of fractures would be improved by new treatment regimens that 

could stimulate the natural bone repair mechanisms, thereby reducing the time 
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required for the fractured bone to heal. In another example, individuals 
afflicted with systemic bone disorders, such as osteoporosis, would benefit 
from treatment regimens that would results in systemic formation of new bone. 
Such treatment regimens would reduce the incidence of fractures arising from 
5 the loss of bone mass that is a characteristic of this disease. 

For at least these reasons, extracellular factors, such as the BMPs, 
have been investigated for the purpose of using them to stimulate formation of 
new bone in vivo. Despite the early successes achieved with BMPs and other 
extracellular signalling molecules, their use entails a number of disadvantages. 

10 For example, relatively large doses of purified BMPs are required to enhance 
the production of new bone, thereby increasing the expense of such treatment 
methods. Furthermore, extracellular proteins are susceptible to degradation 
following their introduction into a host animalr In addition, because they are 
typically immunogenic, the possibility of stimulating an immune response to the 

15 administered proteins is ever present. 

Due to such concerns, it would be desirable to have available treatment 
regimens that use an intracellular signalling molecule to induce new bone 
formation. Advances in the field of gene therapy now make it possible to 
introduce into osteogenic precursor cells, that is, cells involved in bone 

20 formation, or peripheral blood leukocytes, nucleotide fragments encoding 
intracellular signals that form part of the bone formation process. Gene 
therapy for bone formation offers a number of potential advantages: (1 ) lower 
production costs; (2) greater efficacy, compared to extracellular treatment 
regiments, due to the ability to achieve prolonged expression of the 

25 intracellular signal; (3) it would by-pass the possibility that treatment with 
extracellular signals might be hampered due to the presence of limiting 
numbers of receptors for those signals; (4) it permits the delivery of transfected 
potential osteoprogenitor cells directly to the site where localized bone 
formation is required; and (5) it would permit systemic bone formation, thereby 

30 providing a treatment regimen for osteoporosis and other metabolic bone 
diseases. 
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SUMMARY OF THE INVENTION 

The present invention seeks to overcome the drawbacks in the prior art 
5 by providing novels compositions and methods for inducing bone formation 
using an intracellular signalling molecule that participates early in the cascade 
of events that leads to bone formation. Applicants have discovered 10- 
4/RLMP (SEQ ID NO: 1, SEQ ID NO: 2), a novel LIM gene with a sequence 
originally isolated from stimulated rat calvarial osteoblast cultures. The gene 

10 has been cloned, sequenced and assayed for its ability to enhance the efficacy 
of bone mineralization in vitro. The protein RLMP affects mineralization of 
bone matrix as well as differentiation of cells into the osteoblast lineage. Unlike 
other known cytokines, for example, BMPs, RLMP is not a secreted protein, 
but is instead an intracellular signaling molecule. This feature has the 

15 advantage of providing intracellular signaling amplification as well as easier 
assessment of transfected cells. It is also suitable for more efficient and 
specific in vivo applications. Suitable clinical applications include enhancement 
of bone repair in fractures, bone defects, bone grafting, and normal 
homeostasis in patients presenting with osteoporosis. 

20 Applicants have also cloned, sequenced and deduced the amino acid 

sequence of a corresponding human protein, named human LMP-1. The 
human protein demonstrates enhanced efficacy of bone mineralization in vitro 
and in vivo. 

In addition, the applicants have characterized a truncated (short) version 
25 of LMP-1 f termed HLMP-1 s. This short version resulted from a point mutation 
in one source of a cDNA clone, providing a stop codon which truncated the 
protein. The short version (LMP-1 s) is fully functional when expressed in cell 
culture and in vivo. 

Using PCR analysis of human heart cDNA library, Applicants have 
30 identified two alternative splice variants (referred to as HLMP-2 and HLMP-3) 
that differ from HLMP-1 in a region between base pairs 325 and 444 in the 

-4- 
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nucleotide sequence encoding HLMP-1. The HLMP-2 sequence has a 1 19 
base pair deletion and an insertion of 17 base pairs in this region. Compared 
to HLMP-1 , the nucleotide sequence encoding HLMP-3 has no deletions, but it 
does have the same 17 base pairs as HLMP-2, which are inserted at position 
5 444 jn the HLMP-1 sequence. 

Additional features and advantages of the invention will be set forth in 
the description which follows, and in part will be apparent from the description, 
or may be learned by practice of the invention. The objectives and other 
advantages of the invention will be realized and attained by the methods and 
10 compositions of matter particularly pointed out in the written description and 
claims hereof. 

In one broad aspect, the invention relates to an isolated nucleic acid 
molecule comprising a nucleic acid sequence encoding any LIM mineralization 
protein, wherein the nucleic acid molecule hybridizes under standard 

15 conditions to a nucleic acid molecule complementary to the full length of SEQ. 
ID NO: 25, and wherein the molecule hybridizes under highly stringent 
conditions to a nucleic acid molecule complementary to the full length of SEQ. 
ID NO: 26. In a specific aspect, the isolated nucleic acid molecule encodes 
HLMP-1, HLMP-1 s, RLMP, HLMP-2, or HLMP-3. In addition, the invention is 

20 directed to vectors comprising these nucleic acid molecules, as well as host 
cells comprising the vectors. In another specific aspect, the invention relates to 
the proteins themselves. 

In a second broad aspect, the invention relates to antibody that is 
specific for LIM mineralization protein, including HLMP-1, HLMP-1 s, RLMP, 

25 HLMP-2, and HLMP-3. In one specific aspect, the antibody is a polyclonal 
antibody. In another specific aspect, the antibody is a monoclonal antibody. 

In a third broad aspect, the invention relates to method of inducing bone 
formation wherein osteogenic precursor cells are transfected with an isolated 
nucleic acid molecule comprising a nucleotide sequence encoding LIM 

30 mineralization protein. In one specific aspect, the isolated nucleic acid 

molecule is in a vector, which may be a plasmid or a virus, such as adenovirus 
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or retrovirus. The transfection may occur ex vivo or in vivo by direct injection of 
the isolated nucleic acid molecule. The transfected isolated nucleic acid 
molecule may encode HLMP-1, HLMP-1 s, RLMP, HLMP-2, or HLMP-3. 

In a further aspect, the invention relates to methods of fusing a spine by 

5 transfecting osteogenic precursor cells with an isolated nucleic acid molecule 
having a nucleotide sequence encoding LIM mineralization protein, admixing 
the transfected osteogenic precursor cells with a matrix and contacting the 
matrix with the spine: 

In yet another aspect, the invention relates to methods for inducing 

10 systemic bone formation by stable transfection of host cells with the vectors of 
the invention. 

It is to be understood that both the foregoing general description and the 



15 



following detailed description are exemplary and explanatory and are intended 
to provide further explanation of the invention as claimed. 

ABBREVIATIONS AND DEFINITIONS 



20 



25 



30 



BMP 
HLMP-1 



HLMP-1 s 

HLMPU 

LMP 

MEM 

Trm 

-GlyP 

RACE 



Bone Morphogenetic Protein 
Human LMP-1, also 
designated as Human LIM 
Protein or HLMP 
Human LMP-1 Short 
(truncated) protein 
Human LIM Protein Unique 
Region 

LIM mineralization protein 
Minimal essential medium 
Triamcinolone 
Beta-glycerolphosphate 
Rapid Amplification of cDNA 
Ends 
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RLMP 

RLMPU 

RNAsin 

ROB 

10-4 



10 



UTR 



Rat LIM mineralization protein, 
also designated as RLMP-1 

Rat LIM Protein Unique 

Region 

RNase inhibitor 

Rat Osteoblast 

Clone containing cDNA 

sequence for RLMP (SEQ ID 

NO: 2) 

Untranslated Region 



HLMP-2 



Human LMP Splice Variant 2 
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HLMP-3 



Human LMP Splice Variant 3 



DETAILED DESCRIPTION OF THE INVENTION 



The present invention relates to novel mammalian LIM proteins, herein 
20 designated LIM mineralization proteins, or LMP. The invention relates more 
particularly to human LMP, known as HLMP or HLMP-1, or alternative splice 
variants of human LMP, which are known as HLMP-2 or HLMP-3. The 
applicants have discovered that these proteins enhance bone mineralization in 
mammalian cells grown in vitro. When produced in mammals, LMP also 
25 induces bone formation in vivo. 

Ex vivo transfection of bone marrow cells, osteogenic precursor cells, 
peripheral blood leukocytes, or mesenchymal stem cells with nucleic acid that 
encodes LMP or HLMP, followed by reimplantation of the transfected cells in 
the donor, is suitable for treating a variety of bone-related disorders or injuries. 
30 For example, one can use this method to: augment long bone fracture repair; 
generate bone in segmental defects; provide a bone graft substitute for 
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fractures; facilitate tumor reconstruction or spine fusion; and provide a local 
treatment (by injection) for weak or osteoporotic bone, such as in osteoporosis 
of the hip, vertebrae, or wrist. Transfection with LMP or HLMP-encoding 
nucleic acid is also useful in: the percutaneous injection of transfected marrow 
5 cells to accelerate the repair of fractured long bones; treatment of delayed 
union or non-unions of long bone fractures or pseudoarthrosis of spine fusions; 
and for inducing new bone formation in avascular necrosis of the hip or knee. 

In addition to ex wVo-based methods of gene therapy, transfection of a 
recombinant DNA vector comprising a nucleic acid sequence that encodes 

10 LMP or HLMP can be accomplished in vivo. When a DNA fragment that 

encodes LMP or HLMP is inserted into an appropriate viral vector, for example, 
an adenovirus vector, the viral construct can be injected directly into a body 
site were endochondral bone formation is desired. By using a direct; 
percutaneous injection to introduce the LMP or HLMP sequence stimulation of 

15 bone formation can be accomplished without the need for surgical intervention 
either to obtain bone marrow cells (to transfect ex vivo) or to reimplant them 
into the patient at the site where new bone is required. Alden et a/., 
Neurosurgical Focus (1998), have demonstrated the utility of a direct injection 
method of gene therapy using a cDNA that encodes BMP-2, which was cloned 

20 into an adenovirus vector. 

It is also possible to carry out in vivo gene therapy by directly injecting 
into an appropriate body site, a naked, that is, unencapsulated, recombinant 
plasmid comprising a nucleic acid sequence that encodes HLMP. In this 
embodiment of the invention, transfection occurs when the naked plasmid DNA 

25 is taken up, or internalized, by the appropriate target cells, which have been 
described. As in the case of in vivo gene therapy using a viral construct, direct 
injection of naked plasmid DNA offers the advantage that little or no surgical 
intervention is required. Direct gene therapy, using naked plasmid DNA that 
encodes the endothelial cell mitogen VEGF (vascular endothelial growth 

30 factor), has been successfully demonstrated in human patients. Baumgartner 
et a/., Circulation . 97(12):1 114-23 (1998). 
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By using an adenovirus vector to deliver LMP into osteogenic cells, 
transient expression of LMP is achieved. This occurs because adenovirus 
does not incorporate into the genome of target cells that are transfected. 
Transient expression of LMP, that is, expression that occurs during the lifetime 
5 of the transfected target cells, is sufficient to achieve the objects of the 

invention. Stable expression of LMP, however, can occur when a vector that 
incorporates into the genome of the target cell is used as a delivery vehicle. 
Retrovirus-based vectors, for example, are suitable for this purpose. 

Stable expression of LMP is particularly useful for treating various 

10 systemic bone-related disorders, such as osteoporosis and osteogenesis 

imperfecta. For this embodiment of the invention, in addition to using a vector 
that integrates into the genome of the target cell to deliver an LMP-encoding 
nucleotide sequence into target cells, LMP expression is placed under the 
control of a regulatable promoter. For example, a promoter that is turned on by 

15 exposure to an exogenous inducing agent, such as tetracycline, is suitable. 
Using this approach, one can stimulate formation of new bone on a systemic 
basis by administering an effective amount of the exogenous inducing agent. 
Once a sufficient quantity of bone mass is achieved, administration of the 
exogenous inducing agent is discontinued. This process may be repeated as 

20 needed to replace bone mass lost, for example, as a consequence of 
osteoporosis. 

Antibodies specific for HLMP are particularly suitable for use in methods 
for assaying the osteoinductive, that is, bone-forming, potential of patient cells. 
In this way one can identify patients at risk for slow or poor healing of bone 
25 repair. Also, HLMP-specific antibodies are suitable for use in marker assays to 
identify risk factors in bone degenerative diseases, such as, for example, 
osteoporosis. 

Following well known and conventional methods, the genes of the 
present invention are prepared by ligation of nucleic acid segments that 
30 encode LMP to other nucleic acid sequences, such as cloning and/or 
expression vectors. Methods needed to construct and analyze these 
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recombinant vectors, for example, restriction endonuclease digests, cloning 
protocols, mutagenesis, organic synthesis of oligonucleotides and DNA 
sequencing, have been described. For DNA sequencing DNA, the 
dieoxyterminator method is the preferred. 
5 Many treatises on recombinant DNA methods have been published, 

including Sambrook et aA, Molecular Cloning: A Laboratory Manual . Cold 
Spring Harbor Press, 2nd edition (1988), Davis et a/., Basic Methods in 
Molecular Biology . Elsevier (1 986), and Ausubel et al. , Current Protocols in 
Molecular Biology . Wiley Interscience (1988). These reference manuals are 

10 specifically incorporated by reference herein. 

Primer-directed amplification of DNA or cDNA is a common step in the 
expression of the genes of this invention. It is typically performed by the 
polymerase chain reaction (PCR). PCR is described in U.S. Patent No. 
4,800,159 to Mullis et al. and other published sources. The basic principle of 

15 PCR is the exponential replication of a DNA sequence by successive cycles of 
primer extension. The extension products of one primer, when hybridized to 
another primer, becomes a template for the synthesis of another nucleic acid 
molecule. The primer-template complexes act as substrate for DNA 
polymerase, which in performing its replication function, extends the primers. 

20 The conventional enzyme for PCR applications is the thermostable DNA 
polymerase isolated from Thermus aquaticus, or Taq DNA polymerase. 

Numerous variations of the basic PCR method exist, and a particular 
procedure of choice in any given step needed to construct the recombinant 
vectors of this invention is readily performed by a skilled artisan. For example, 

25 to measure cellular expression of 10-4/RLMP, RNA is extracted and reverse 
transcribed under standard and well known procedures. The resulting cDNA is 
then analyzed for the appropriate mRNA sequence by PCR. 

The gene encoding the LIM mineralization protein is expressed in an 
expression vector in a recombinant expression system. Of course, the 

30 constructed sequence need not be the same as the original, or its 

complimentary sequence, but instead may be any sequence determined by the 

- 10- 
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degeneracy of the DNA code that nonetheless expresses an LMP having bone 
forming activity. Conservative amino acid substitutions, or other modifications, 
such as the occurrance of an amino-terminal methionine residue, may also be 
employed. 

5 A ribosome binding site active in the host expression system of choice is 

ligated to the 5' end of the chimeric LMP coding sequence, forming a synthetic 
gene. The synthetic gene can be inserted into any one of a large variety of 
vectors for expression by ligating to an appropriately linearized plasmid. A 
regulatable promoter, for example, the E. coli lac promoter, is also suitable for 

10 the expression of the chimeric coding sequences. Other suitable regulatable 
promoters include trp, tac, recA, 77 and lambda promoters. 

DNA encoding LMP is transfected into recipient cells by one of several 
= standard published procedures, for example, calcium phosphate precipitation, 
DEAE-Dextran, electroporation or protoplast fusion, to form stable 

15 transformants. Calcium phosphate precipitation is preferred, particularly when 
performed as follows. 

DNAs are coprecipitated with calcium phosphate according to the 
method of Graham and Van Der, Virology . 52:456 (1973), before transfer into 
cells. An aliquot of 40-50 g of DNA, with salmon sperm or calf thymus DNA as 

20 a carrier, is used for 0.5x1 0 6 cells plated on a 100 mm dish. The DNA is mixed 
with 0.5 ml of 2X Hepes solution (280 mM NaCI, 50 mM Hepes and 1.5 mM 
Na 2 HP0 4 , pH 7.0), to which an equal volume of 2x CaCI 2 (250 mM CaCI 2 and 
10 mM Hepes, pH 7.0) is added. A white granular precipitate, appearing after 
30-40 minutes, is evenly distributed dropwise on the cells, which are allowed to 

25 incubate for 4-16 hours at 37°C. The medium is removed and the cells 

shocked with 15% glycerol in PBS for 3 minutes. After removing the glycerol, 
the cells are fed with Dulbecco's Minimal Essential Medium (DMEM) containing 
10% fetal bovine serum. 

DNA can also be transfected using: the DEAE-Dextran methods of 

30 Kimura et a/., Virology , 49:394 (1972) and Sompayrac et a/., Proc. Natl. Acad. 
Sci. USA . 78:7575 (1981); the electroporation method of Potter, Proc. Natl. 

- 11 - 

BNSDOCICfc <WO 00661 78A1_I_> 



WO 00/66178 



PCTAJS00/11664 



Acad. Sci. USA . 81:7161 (1984); and the protoplast fusion method of Sandri- 
Goddin efa/.. Molec. Cell. Biol. . 1:743(1981). 

Phosphoramidite chemistry in solid phase is the preferred method for 
the organic synthesis of oligodeoxynucleotides and polydeoxynucleotides. In 
5 addition, many other organic synthesis methods are available. Those methods 
are readily adapted by those skilled in the art to the particular sequences of the 
invention. 

The present invention also includes nucleic acid molecules that 
hybridize under standard conditions to any of the nucleic acid sequences 

10 encoding the LIM mineralization proteins of the invention. "Standard 

hybridization conditions" will vary with the size of the probe, the background 
and the concentration of the nucleic acid reagents, as well as the type of 
hybridization, for example, in situ, Southern blot, or hybrization of DNA-RNA 
hybrids (Northern blot). The determination of "standard hybridization 

15 conditions" is within the level of skill in the art. For example, see U.S. Patent 
5,580,775 to Fremeau et a/., herein incorporated by reference for this purpose. 
See also, Southern, E. M., J.'Mol. Biol. . 98:503 (1975), Alwine et a/., Meth. 
EnzvmoL 68:220 (1979), and Sambrook et a/., Molecular Cloning: A 
laboratory Manual . 2nd edition, pp. 7.19-7.50, Cold Spring Harbor Press 

20 (1989). 

One preferred set of standard hybrization conditions involves a blot that 
is prehybridized at 42°C for 2 hours in 50% formamide, 5X SSPE (150 nM 
NaCI, 10 mM Na H 2 P0 4 [pH 7.4], 1 mM EDTA [pH 8.0]), 5X Denhardt's solution 
(20 mg Ficoll, 20 mg polyvinylpyrrolidone and 20 mg BSA per 100 ml water), 

25 10% dextran sulphate, 1% SDS and 100 g/ml salmon sperm DNA. A 32 P- 
labelled cDNA probe is added, and hybridization is continued for 14 hours. 
Afterward, the blot is washed twice with 2X SSPE, 0.1% SDS for 20 minutes at 
22°C, followed by a 1 hour wash at 65°C in 0.1X SSPE, 0.1 %SDS. The blot is 
then dried and exposed to x-ray film for 5 days in the presence of an 

30 intensifying screen. 
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Under "highly stringent conditions," a probe will hybridize to its target 
sequence if those two sequences are substantially identical. As in the case of 
standard hybridization conditions, one of skill in the art can, given the level of 
skill in the art and the nature of the particular experiment, determine the 
5 conditions under which only susbstantially identical sequences will hybridize. 

Another aspect of the invention includes the proteins encoded by the 
nucleic acid sequences. In still another embodiment, the inventon relates to 
the identification of such proteins based on anti-LMP antibodies. In this 
embodiment, protein samples are prepared for Western blot analysis by lysing 

10 cells and separating the proteins by SDS-PAGE. The proteins are transferred 
to nitrocellulose by electroblotting as described by Ausubel et aL, Current 
- Protocols in Molecular Biology . John Wiley and Sons (1987). After blocking 
the filter with instant nonfat dry milk (1 gm in 100 ml PBS), anti-LMP antibody is 
added to the filter and incubated for 1 hour at room temperature. The filter is 

15 washed thoroughly with phosphate buffered saline (PBS) and incubated with 
horseradish peroxidase (HRPO)-antibody conjugate for 1 hour at room 
temperature. The filter is again washed thoroughly with PBS and the antigen 
bands are identified by adding diaminobenzidine (DAB). 

Monospecific antibodies are the reagent of choice in the present 

20 invention, and are specifically used to analyze patient cells for specific 
characteristics associated with the expression of LMP. "Monospecific 
antibody" as used herein is defined as a single antibody species or multiple 
antibody species with homogenous binding characteristics for LMP. 
"Homogeneous binding" as used herein refers to the ability of the antibody 

25 species to bind to a specific antigen or epitope, such as those associated with 
LMP, as described above. Monospecific antibodies to LMP are purified from 
mammalian antisera containing antibodies reactive against LMP or are 
prepared as monoclonal antibodies reactive with LMP using the technique of 
Kohler and Milstein, Nature . 256:495-97 (1975). The LMP specific antibodies 

30 are raised by immunizing animals such as, for example, mice, rats, guinea 
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pigs, rabbits, goats or horses, with an appropriate concentration of LMP either 
with or without an immune adjuvant. 

In this process, preimmune serum is collected prior to the first 
immunization. Each animal receives between about 0.1 mg and about 1000 
5 mg of LMP associated with an acceptable immune adjuvant, if desired. Such 
acceptable adjuvants include, but are not limited to, Freund's complete, 
Freund's incomplete, alum-precipitate, water in oil emulsion containing 
Corynebactehum parvum and tRNA adjuvants. The initial immunization 
consists of LMP in, preferably, Freund's complete adjuvant injected at multiple 

10 sites either subcutaneously (SC), intraperitoneal^ (IP) or both. Each animal is 
bled at regular intervals, preferably weekly, to determine antibody titer. The 
animals may or may not receive booster injections following the initial 
immunization. Those" animals receiving Booster injections are generally given 
an equal amount of the antigen in Freund's incomplete adjuvant by the same 

15 route. Booster injections are given at about three week intervals until maximal 
titers are obtained. At about 7 days after each booster immunization or about 
weekly after a single immunization, the animals are bled, the serum collected, 
and aliquots are stored at about -20° C. 

Monoclonal antibodies (mAb) reactive with LMP are prepared by 

20 immunizing inbred mice, preferably Balb/c mice, with LMP. The mice are 

immunized by the IP or SC route with about 0.1 mg to about 10 mg, preferably 
about 1 mg, of LMP in about 0.5 ml buffer or saline incorporated in an equal 
volume of an acceptable adjuvant, as discussed above. Freund's complete 
adjuvant is preferred. The mice receive an initial immunization on day 0 and 

25 are rested for about 3-30 weeks. Immunized mice are given one or more 

booster immunizations of about 0.1 to about 10 mg of LMP in a buffer solution 
such as phosphate buffered saline by the intravenous (IV) route. Lymphocytes 
from antibody-positive mice, preferably splenic lymphocytes, are obtained by 
removing the spleens from immunized mice by standard procedures known in 

30 the art. Hybridoma cells are produced by mixing the splenic lymphocytes with 
an appropriate fusion partner, preferably myeloma cells, under conditions 
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which will allow the formation of stable hybridomas. Fusion partners may 
include, but are not limited to: mouse myelomas P3/NS1/Ag 4-1; MPC-11; S- 
194 and Sp 2/0, with Sp 2/0 being preferred. The antibody producing cells and 
myeloma cells are fused in polyethylene glycol, about 1000 mol. wt., at 

5 concentrations from about 30% to about 50%. Fused hybridoma cells are 
selected by growth in hypoxanthine, thymidine and aminopterin in 
supplemented Dulbecco's Modified Eagles Medium (DMEM) by procedures 
known in the art. Supernatant fluids are collected from growth positive wells on 
about days 14, 18, and 21 , and are screened for antibody production by an 

10 immunoassay such as solid phase immunoradioassay (SPIRA) using LMP as 
the antigen. The culture fluids are aiso tested in the Ouchieriony precipitation 
assay to determine the isotype of the mAb. Hybridoma ce lls from antibody 
positive wells are cloned by a technique such as the soft agar technique of 
MacPherson, "Soft Agar Techniques", in Tissue Culture Methods and 

15 Applications . Kruse and Paterson (eds.), Academic Press (1973). See, also, 
Harlow ef a/., Antibodies: A Laboratory Manual . Cold Spring Laboratory 
(1988). 

Monoclonal antibodies may also be produced in vivo by injection of 
pristane- primed Balb/c mice, approximately 0.5 ml per mouse, with about 
20 2x1 0 6 to about 6x1 0 6 hybridoma cells about 4 days after priming. Ascites fluid 
is collected at approximately 8-12 days after cell transfer and the monoclonal 
antibodies are purified by techniques known in the art. 

In vitro production in anti-LMP mAb is carried out by growing the 
hydridoma cell line in DMEM containing about 2% fetal calf serum to obtain 
25 sufficient quantities of the specific mAb. The mAb are purified by techniques 
known in the art. 

Antibody titers of ascites or hybridoma culture fluids are determined by 
various serological or immunological assays, which include, but are not limited 
to, precipitation, passive agglutination, enzyme-linked immunosorbent antibody 
30 (ELISA) technique and radioimmunoassay (RIA) techniques. Similar assays 

- 15- 

BNSDOCia <WO 0066178A1_I_> 



WO 00/66178 



PCT/USOO/11664 



are used to detect the presence of the LMP in body fluids or tissue and cell 
extracts. 

It is readily apparent to those skilled in the art that the above described 
methods for producing monospecific antibodies may be utilized to produce 
5 antibodies specific for polypeptide fragments of LMP, full-length nascent LMP 
polypeptide, or variants or alleles thereof. 

In another embodiment, the invention is directed to alternative splice 
variants of HLMP-1 . PCR analysis of human heart cDNA revealed mRNA for 
two HLMP alternative splice variants, named HLMP-2 and HLMP-3, that differ 
10 from HLMP-1 in a region between base pairs 325 and 444 in the hLMP-1 
sequence. The HLMP-2 sequence has a 119 base pair deletion and an 
insertion of 17 base pairs in t hi s region. These changes preserve the reading 
frame, resulting in a 423 amino acid protein, which compared to HLMP-1, has a 
net loss of 34 amino acids (40 amino acids deleted plus 6 inserted amino 
15 acids). HLMP-2 contains the c-terminal LIM domains that are present in 
HLMP-1. 

Compared to HLMP-1, HLMP-3 has no deletions, but it does have the 
same 17 base pair insertion at position 444. This insertion shifts the reading 
frame, causing a stop codon at base pairs 459-461. As a result, HLMP-3 
20 encodes a protein of 153 amino acids. This protein lacks the c-terminal LIM 
domains that are present in HLMP-1 and HLMP-2. The predicted size of the 
proteins encoded by HLMP-2 and HLMP-3 was confirmed by western blot 
analysis. 

PCR analysis of the tissue distribution of the three splice variants 
25 revealed that they are differentially expressed, with specific isoforms 

predominating in different tissues. HLMP-1 is apparently the predominant form 
expressed in leukocytes, spleen, lung, placenta, and fetal liver. HLMP-2 
appears to be the predominant isoform in skeletal muscle, bone marrow, and 
heart tissue. HLMP-3, however, was not the predominant isoform in any tissue 
30 examined. 
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Overexpression of HLMP-3 in secondary rat osteoblast cultures 
induced bone nodule formation (287±56) similar to the effect seen for 
glucicorticoid (272±7) and HLMP-1 (232±200). Since HLMP-3 lacks the C- 
terminal LIM domains, there regions are not required for osteoinductive activity. 
5 Overexpression of HLMP-2, however, did not induce nodule formation (1 1±3). 
These data suggest that the amino acids encoded by the deleted 119 base 
pairs are necessary for osteoinduction. The data also suggest that the 
distribution of HLMP splice variants may be important for tissue-specific 
function. Surprisingly, we have shown that HLMP-2 inhibits steroid-induced 

10 osteoblast formation in secondary rat osteoblast cultures. Therefore, HLMP-2 
wiii have therapeutic utility in ciinicai situations where bone formation is not 

desirable. _ __ 

On July 22, 1997, a sample of 10-4/RLMP in a vector designated 
pCMV2/RLMP (which is vector pRc/CMV2 with insert 10-4 clone/RLMP) was 

15 deposited with the American Type Culture Collection (ATCC), 12301 Parklawn 
Drive, Rockville, MD 20852. The culture accession number for that deposit is 
2091 53. On March 19,1 998, a sample of the vector pHis-A with insert HLPM- 
1s was deposited at the American Type Culture Collection ("ATCC"). The 
culture accession number for that deposit is 209698. On April 14, 2000, 

20 samples of plasmids pHAhLMP-2 (vector pHisA with cDNA insert derived from 
human heart muscle cDNA with HLMP-2) and pHAhLMP-3 (vector pHisA with 
cDNA insert derived from human heart muscle cDNA with HLMP-3) were 
deposited with the ATCC, 10801 University Blvd., Manassas, VA, 201 10-2209, 
USA, under the conditions of the Budapest treaty. The accession numbers for 

25 these deposits are and , respectively. These deposits, as 

required by the Budapest Treaty, will be maintained in the ATCC for at least 30 
years and will be made available to the public upon the grant of a patent 
disclosing them. It should be understood that the availability of a deposit does 
not constitute a license to practice the subject invention in derogation of patent 

30 rights granted by government action. 
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In assessing the nucleic acids, proteins, or antibodies of the invention, 
enzyme assays, protein purification, and other conventional biochemical 
methods are employed. DNA and RNA are analyzed by Southern blotting and 
Northern blotting techniques, respectively. Typically, the samples analyzed are 

5 size fractionated by gel electrophoresis. The DNA or RNA in the gels are then 
transferred to nitrocellulose or nylon membranes. The blots, which are replicas 
of sample patterns in the gels, were then hybridized with probes. Typically, the 
probes are radiolabeled, preferably with 32 P, although one could label the 
probes with other signal-generating molecules known to those in the art. 

10 Specific bands of interest can then be visualized by detection systems, such as 
autoradiography. 

For purposes of illustrating preferred embodiments of the present 
"~ ' invention, the following, non-limiting examples are included. These results 

demonstrate the feasibiliity of inducing or enhancing the formation of bone 

15 using the LIM mineralization proteins of the invention, and the isolated nucleic 
acid molecules encoding those proteins. 

Fxam ple 1: Calvarial Cell Culture 

Rat calvarial cells, also known as rat osteoblasts ("ROB"), were obtained 

20 from 

20-day pre-parturition rats as previously described. Boden et a/., 
Endocrinology . 137(8):3401-07 (1996). Primary cultures were grown to 
confluence (7 days), trypsinized, and passed into 6-well plates (1 x 10 5 cells/35 
mm well) as first subculture cells. The subculture cells, which were confluent 

25 at day 0, were grown for an additional 7 days. Beginning on day 0, media were 
changed and treatments (Trm and/or BMPs) were applied, under a laminar flow 
hood, every 3 or 4 days. The standard culture protocol was as follows: days 1- 
7, MEM, 10% FBS, 50 g/ml ascorbic acid, ± stimulus; days 8-14, BGJb 
medium, 10% FBS, 5mM -GlyP (as a source of inorganic phosphate to permit 

30 mineralization). Endpoint analysis of bone nodule formation and osteocalcin 
secretion was performed at day 14. The dose of BMP was chosen as 50 ng/ml 

- 18- 

3NSDOCID <WO 006817BA1_I_> 



WO 00/66178 PCTAJS00/11664 

based on pilot experiments in this system that demonstrated a mid-range effect 
on the dose-response curve for all BMPs studied. 



EXAMPLE 2: Antisense Treatment and Cell Culture 

5 

To explore the potential functional role of LMP-1 during membranous 
bone formation, we synthesized an antisense oligonucleotide to block LMP-1 
mRNA translation and treated secondary osteoblast cultures that were 
undergoing differentiation initiated by glucocorticoid. Inhibition of RLMP 

10 expression was accomplished with a highly specific antisense oligonucleotide 
(having no significant homologies to known rat sequences) corresponding to a 
25 bp sequence spanning the putative translational start site (SEQ ID NO: 42). 
Control cultures either did not receive oligonucleotide or they received sense 
oligonucleotide. Experiments were performed in the presence (preincubation) 

15 and absence of lipofectamine. Briefly, 22 g of sense or antisense RLMP 
oligonucleotide was incubated in MEM for 45 minutes at room temperature. 
Following that incubation, either more MEM or pre-incubated 
lipofectamine/MEM (7% v/v; incubated 45 minutes at room temperature) was 
added to achieve an oligonucleotide concentration of 0.2 M. The resulting 

20 mixture was incubated for 1 5 minutes at room temperature. Oligonucleotide 
mixtures were then mixed with the appropriate medium, that is, 
MEM/Ascorbate/±Trm, to achieve a final oligonucleotide concentration of 0.1 
M. 

Cells were incubated with the appropriate medium (±stimulus) in the 
25 presence or absence of the appropriate oligonucleotides. Cultures originally 
incubated with lipofectamine were re-fed after 4 hours of incubation (37°C; 5% 
C0 2 ) with media containing neither lipofectamine nor oligonucleotide. All 
cultures, especially cultures receiving oligonucleotide, were re-fed every 24 
hours to maintain oligonucleotide levels. 
30 LMP-1 antisense oligonucleotide inhibited mineralized nodule formation 

and osteocalcin secretion in a dose-dependent manner, similar to the effect of 
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BMP-6 oligonucleotide. The LMP-1 antisense block in osteoblast 
differentiation could not be rescued by addition of exogenous BMP-6, while the 
BMP-6 antisense oligonucleotide inhibition was reversed with addition of BMP- 
6. This experiment further confirmed the upstream position of LMP-1 relative 
5 to BMP-6 in the osteoblast differentiation pathway. LMP-1 antisense 

oligonucleotide also inhibited spontaneous osteoblast differentiation in primary 
rat osteoblast cultures. 



EXAMPLE 3: Quantitation of Mineralized Bone Nodule Formation 

10 

Cultures of ROBs prepared according to Examples 1 and 2 were fixed 
overnight in 70% ethanol and stained with von Kossa silver stain. A semi- 
automated computerized video image analysis system was used to quantitate 
nodule count and nodule area in each well. Boden et a/., Endocrinology . 

15 137(8):3401-07 (1996). These values were then divided to calculate the area 
per nodule values. This automated process was validated against a manual 
counting technique and demonstrated a correlation coefficient of 0.92 
(p < 0.000001). All data are expressed as the mean ± standard error of the 
mean (S.E.M.) calculated from 5 or 6 wells at each condition. Each experiment 

20 was confirmed at least twice using cells from different calvarial preparations. 

EXAMPLE 4: Quantitation of Osteocalcin Secretion 



Osteocalcin levels in the culture media were measured using a 
25 competitive radioimmunoassay with a monospecific polyclonal antibody (Pab) 
raised in our laboratory against the C-terminal nonapeptide of rat osteocalcin 
as described in Nanes et a/., Endocrinology . 127:588 (1990). Briefly, 1 g of 
nonapeptide was iodinated with 1 mCi 125 l-Na by the lactoperoxidase method. 
Tubes containing 200 I of assay buffer (0.02 M sodium phosphate, 1 mM 
30 EDTA, 0.001% thimerosal, 0.025% BSA) received media taken from cell 
cultures or osteocalcin standards (0 - 12,000 fmole) at 100 l/tube in assay 
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buffer. The Pab (1:40,000; 100 I) was then added, followed by the iodinated 
peptide (12,000 cpm; 100 I). Samples tested for non-specific binding were 
prepared similarly but contained no antibody. 

Bound and free PAbs were separated by the addition of 700 I goat anti- 
5 rabbit IgG, followed by incubation for 18 hours at 4°C. After samples were 
centrifuged at 1200 rpm for 45 minutes, the supernatants were decanted and 
the precipitates counted in a gamma counter. Osteocalcin values were 
reported in fmole/100 1, which was then converted to pmole/ml medium (3-day 
production) by dividing those values by 100. Values were expressed as the 
10 mean ± S.E.M. of triplicate determinations for 5-6 wells for each condition. 
Each experiment was confirmed at least two times using cells from different 
calvarial preparations. 

EXAMPLE 5: Effect of Trm and RLMP on Mineralization In Vitro 

15 

There was little apparent effect of either the sense or antisense 
oligonucleotides on the overall production of bone nodules in the non- 
stimulated cell culture system. When ROBs were stimulated with Trm, 
however, the antisense oligonucleotide to RLMP inhibited mineralization of 
20 nodules by > 95%. The addition of exogenous BMP-6 to the oligonucleotide- 
treated cultures did not rescue the mineralization of RLMP-antisense-treated 
nodules. 

Osteocalcin has long been synonymous with bone mineralization, and 
osteocalcin levels have been correlated with nodule production and 

25 mineralization. The RLMP-antisense oligonucleotide significantly decreases 
osteocalcin production, but the nodule count in antisense-treated cultures does 
not change significantly. In this case, the addition of exogenous BMP-6 only 
rescued the production of osteocalcin in RLMP-antisense-treated cultures by 
10-15%. This suggests that the action of RLMP is downstream of, and more 

30 specific than, BMP-6. 
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EXAMPLE 6: Harvest and Purification of RNA 

Cellular RNA from duplicate wells of ROBs (prepared according to 
5 Examples 1 and 2 in 6-well culture dishes) was harvested using 4M guanidine 
isothiocyanate (GIT) solution to yield statistical triplicates. Briefly, culture 
supernatant was aspirated from the wells, which were then overlayed with 0.6 
ml of GIT solution per duplicate well harvest. After adding the GIT solution, the 
plates were swirled for 5-10 seconds (being as consistent as possible). 

10 Samples were saved at -70°C for up to 7 days before further processing. 
RNA was purified by a slight modification of standard methods 
according to Sambrook et a/., Molecular Cloning: a Laboratory Manual . 2nd 
Ed., chapter 7.19, Cold Spring Harbor Press (1989) .^Briefly, thawed^samples 
received 60 I 2.0 M sodium acetate (pH 4.0), 550 I phenol (water saturated) 

15 and 150 I chlorofornrisoamyl alcohol (49:1). After vortexing, the samples were 
centrifuged (10000 x g; 20 minutes; 4°C), the aqueous phase transferred to a 
fresh tube, 600 I isopropanol was added and the RNA precipitated overnight at 
-20°C. 

Following the overnight incubation, the samples were centrifuged 
20 (10000 x g; 20 minutes) and the supernatant was aspirated gently. The pellets 
were resuspended in 400 I DEPC-treated water, extracted once with 
phenoI:chloroform (1:1), extracted with chlorofornrisoamyl alcohol (24:1) and 
precipitated overnight at -20°C after addition of 40 I sodium acetate (3.0 M; pH 
5.2) and 1.0 ml absolute ethanol. To recover the cellular RNA, the samples 
25 were centrifuged (10000 x g; 20 min), washed once with 70% ethanol, air dried 
for 5-10 minutes and resuspended in 20 I of DEPC-treated water. RNA 
concentrations were calculated from optical densities that were determined 
with a spectrophotometer. 
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EXAMPLE 7: Reverse Transcription-Polymerase Chain Reaction 

Heated total RNA (5 g in 10.5 I total volume DEPC-H 2 0 at 65°C for 5 
minutes) was added to tubes containing 4 I 5X MMLV-RT buffer, 2 I dNTPs, 2 I 
5 dT17 primer (10 pmol/ml), 0.5 I RNAsin (40U/ml) and 1 I MMLV-RT (200 
units/ 1). The samples were incubated at 37°C for 1 hour, then at 95°C for 5 
minutes to inactivate the MMLV-RT. The samples were diluted by addition of 
80 I of water. 

Reverse-transcribed samples (5 I) were subjected to polymerase-chain 
10 reaction using standard methodologies (50 1 total volume). Briefly, samples 
were added to tubes containing water and appropriate amounts of PCR buffer, 
25 mM MgCI 2 , dNTPs, forward and reverse primers for glyceraldehyde 3- 
phosphate dehydrogenase (GAP, a housekeeping gene) and/or BMP-6), 32 P- 
dCTP, and Taq polymerase. Unless otherwise noted, primers were 
15 standardized to run consistently at 22 cycles (94°C, 30"; 58°C, 30"; 72°C, 20"). 

EXAMPLE 8: Quantitation of RT-PCR Products bv Polvacrvlamide Gel 
Electrophoresis (PAGE> and Phosphorlmaaer Analysis 

20 

RT-PCR products received 5 l/tube loading dye, were mixed, heated at 
65°C for 10 min and centrifuged. Ten I of each reaction was subjected to 
PAGE (12% polyacrylamide:bis; 15 V/well; constant current) under standard 
conditions. Gels were then incubated in gel preserving buffer (10% v/v 
25 glycerol, 7% v/v acetic acid, 40% v/v methanol, 43% deionized water) for 30 
minutes, dried (80°C) in vacuo for 1-2 hours and developed with an 
electronically-enhanced phosphoresence imaging system for 
6-24 hours. Visualized bands were analyzed. Counts per band were plotted 
graphically. 

30 
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EXAMPLE 9: Differential Display PCR 

RNA was extracted from cells stimulated with glucocorticoid (Trm, 1 
nM). Heated, DNase-treated total RNA (5 g in 10.5 I total volume in DEPC- 
5 H 2 0 at 65°C for 5 minutes) was reverse transcribed as described in Example 7, 
but H-T^M (SEQ ID. NO: 4) was used as the MMLV-RT primer. The resulting 
cDNAs were PCR-amplified as described above, but with various commercial 
primer sets (for example, 

H-T n G (SEQ ID NO: 4) and H-AP-10 (SEQ ID. NO: 5); GenHunter Corp, 
10 Nashville, TN). Radiolabeled PCR products were fractionated by gel 

electrophoresis on a DNA sequencing gel. After electrophoresis, the resulting 
gels werejjried in vacu o and autoradio grap hs were exposed overnight. Bands 
representing differentially-expressed cDNAs were excised from the gel and 
reamplified by PCR using the method of Conner et a/., Proc. Natl. Acad. ScL 
15 USA . 88:278 (1983). The products of PCR reamplification were cloned into 
the vector PCR-II (TA cloning kit; InVitrogen, Carlsbad, CA). 

EXAMPLE 10: Screening of a UMR 106 Rat Osteosarcoma Cell cDNA Library 

20 A UMR 106 library (2.5 x 10 10 pfu/ml) was plated at 5 x 10 4 pfu/ml onto 

agar plates (LB bottom agar) and the plates were incubated overnight at 37°C. 
Filter membranes were overlaid onto plates for two minutes. Once removed, 
the filters were denatured, rinsed, dried and UV cross-linked. The filters were 
then incubated in pre-hyridization buffer (2X PIPES [pH 6.5], 5% formamide, 

25 1% SDS and 100 g/ml denatured salmon sperm DNA) for 2 h at 42°C. A 260 
base-pair radiolabeled probe (SEQ ID NO: 3; 32 P labelled by random priming) 
was added to the entire hybridization mix/filters, followed by hybridization for 18 
hours at 42°C. The membranes were washed once at room temperature (10 
min, 1 x SSC, 0.1% SDS) and three times at 55°C (15 min, 0.1 x SSC, 0.1% 

30 SDS). 
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After they were washed, the membranes were analyzed by 
autoradiography as described above. Positive clones were plaque purified. 
The procedure was repeated with a second filter for four minutes to minimize 
spurious positives. Plaque-purified clones were rescued as lambda SK(-) 
5 phagemids. Cloned cDNAs were sequenced as described below. 

EXAMPLE 11: Sequencing of Clones 

Cloned cDNA inserts were sequenced by standard methods. Ausubel et 
10 a/.. Current Protocols in Molecular Biology . Wiley Interscience (1988). Briefly, 
appropriate concentrations of termination mixture, template and reaction 
mixture were subjected to an appropriate cycling protocol (95°C,30s; 68°C t 30s; 
72°C,60s; x 25). Stop mixture was added to terminate the sequencing 
reactions. After heating at 92°C for 3 minutes, the samples were loaded onto a 
15 denaturing 6% polyacrylamide sequencing gel (29:1 acrylamide:bis- 

acrylamide). Samples were electrophoresed for about 4 hours at 60 volts, 
constant current. After electrophoresis, the gels were dried in vacuo and 
autoradiographed. 

The autoradiographs were analyzed manually. The resulting sequences 
20 were screened against the databases maintained by the National Center for 
Biotechnology Information (NIH, Bethesda, MD; http://www.ncbi.nlm.nih.gov/) 
using the BLASTn program set with default parameters. Based on the 
sequence data, new sequencing primers were prepared and the process was 
repeated until the entire gene had been sequenced. All sequences were 
25 confirmed a minimum of three times in both orientations. 

Nucleotide and amino acid sequences were also analyzed using the 
PCGENE software package (version 16.0). Per cent homology values for 
nucleotide sequences were calculated by the program NALIGN, using the 
following parameters: weight of non-matching nucleotides, 10; weight of non- 
30 matching gaps, 10; maximum number of nucleotides considered, 50; and 
minimum number of nucleotides considered, 50. 
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For amino acid sequences, per cent homology values were calculated 
using PALIGN. A value of 10 was selected for both the open gap cost and the 
unit gap cost. 

5 EXAMPLE 12: Cloning of RLMP cDNA 

The differential display PCR amplification products described in 
Example 9 contained a major band of approximately 260 base pairs. This 
sequence was used to screen a rat osteosarcoma (UMR 106) cDNA library. 

10 Positive clones were subjected to nested primer analysis to obtain the primer 
sequences necessary for amplifying the full length cDNA. (SEQ. ID NOs: 11, 
1 2, 29, 30 and 31 ) One of those positive clones selected for further study was 
designated clone 10^. 

Sequence analysis of the full-length cDNA in clone 10-4, determined by 

15 nested primer analysis, showed that clone 10-4 contained the original 260 
base-pair fragment identified by differential display PCR. Clone 10-4 (1696 
base pairs; SEQ ID NO: 2) contains an open reading frame of 1371 base pairs 
encoding a protein having 457 amino acids (SEQ ID NO: 1). The termination 
codon, TGA, occurs at nucleotides 1444-1446. The polyadenylation signal at 

20 nucleotides 1675-1680, and adjacent poly(A) + tail, was present in the 3* 

noncoding region. There were two potential N-glycosylation sites, Asn-Lys-Thr 
and Asn-Arg-Thr, at amino acid positions 113-116 and 257-259 in SEQ ID NO: 
1, respectively. Two potential cAMP- and cGMP-dependent protein kinase 
phosphorylation sites, Ser and Thr, were found at amino acid positions 191 

25 and 349, respectively. There were five potential protein kinase C 

phosphorylation sites, Ser or Thr, at amino acid positions 3, 115, 166, 219, 
442. One potential ATP/GTP binding site motif A (P-loop), Gly-Gly-Ser-Asn- 
Asn-Gly-Lys-Thr, was determined at amino acid positions 272-279. 

In addition, two highly conserved putative LIM domains were found at 

30 amino acid positions 341-391 and 400-451. The putative LIM domains in this 
newly identified rat cDNA clone showed considerable homology with the LIM 
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domains of other known LIM proteins. However, the overall homology with 
other rat LIM proteins was less than 25%. RLMP (also designated 10-4) has 
78.5% amino acid homology to the human enigma protein (see U.S. Patent No. 
5,504,192), but only 24.5% and 22.7% amino acid homology to its closest rat 
5 homologs, CLP-36 and RIT-18, respectively. 

EXAMPLE 13: Northern Blot Analysis of RLMP Expression 

Thirty g of total RNA from ROBs, prepared according to Examples 1 and 
10 2, was size fractionated by formaldehyde gel electrophoresis in 1% agarose 
flatbed geis and osmoticaiiy iransbiotted io nylon membranes. The blot was 
___ probed with a 600 base pair EcoR1 fragment of full-length 1 0-4 cDNA labeled 
with 32 P-dCTP by random priming. 

Northern blot analysis showed a 1.7 kb mRNA species that hybridized 
15 with the RLMP probe. RLMP mRNA was up-regulated approximately 3.7-fold 
in ROBs after 24 hours exposure to BMP-6. No up-regulation of RMLP 
expression was seen in BMP-2 or BMP-4-stimulated ROBs at 24 hours. 

EXAMPLE 14: Statistical Methods 

20 

For each reported nodule/osteocalcin result, data from 5-6 wells from a 
representative experiment were used to calculate the mean ± S.E.M. Graphs 
may be shown with data normalized to the maximum value for each parameter 
to allow simultaneous graphing of nodule counts, mineralized areas and 

25 osteocalcin. 

For each reported RT-PCR, RNase protection assay or Western blot 
analysis, data from triplicate samples of representative experiments, were used 
to determine the mean ± S.E.M. Graphs may be shown normalized to either 
day 0 or negative controls and expressed as fold-increase above control 

30 values. 
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Statistical significance was evaluated using a one-way analysis of 
variance with post-hoc multiple comparison corrections of Bonferroni as 
appropriate. D. V. Huntsberger, 'The Analysis of Variance," in Elements of 
Statistical Variance . P. Billingsley (ed.), pp. 298-330, Allyn & Bacon Inc., 
5 Boston, MA (1977) and Sigmastat, Jandel Scientific, Corte Madera, CA. Alpha 
levels for significance were defined as p < 0.05. 

EXAMPLE 15: Detection of Rat LIM Mineralization Protein by Western Blot 
Analysis 

10 

Polyclonal antibodies were prepared according to the methods of 
England etal.. Biochim.Biophys. Acta . 623:171 (1980) and Timmer et a/., J, 
Biol. Chem. . 268:24863 (1993). " 

HeLa cells were transfected with pCMV2/RLMP. Protein was harvested 
15 from the transfected cells according to the method of Hair et a/., Leukemia 
Research . 20:1 (1996). Western Blot Analysis of native RLMP was performed 
as described by Towbin et a/., Proc. Natl. Acad. Sci. USA . 76:4350 (1979). 

EXAMPLE 16: Synthesis of the Rat LMP-Unique (RLMPU) derived Human 
20 PCR product 

Based on the sequence of the rat LMP-1 cDNA, forward and reverse 
PCR primers (SEQ ID NOs: 15 and 16) were synthesized and a unique 223 
base-pair sequence was PCR amplified from the rat LMP-1 cDNA. A similar 

25 PCR product was isolated from human MG63 osteosarcoma cell cDNA with the 
same PCR primers. 

RNA was harvested from MG63 osteosarcoma cells grown in T-75 
flasks. Culture supernatant was removed by aspiration and the flasks were 
overlayed with 3.0 ml of GIT solution per duplicate, swirled for 5-10 seconds, 

30 and the resulting solution was transferred to 1 .5 ml eppendorf tubes (5 tubes 
with 0.6 ml/tube). RNA was purified by a slight modification of standard 
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methods, for example, see Sambrook et a/., Molecular Cloning: A Laboratory 
Manual, chapter 7, page 19, Cold Spring Harbor Laboratory Press (1989) and 
Boden et a/., Endocrinology . 138:2820-28 (1997). Briefly, the 0.6 ml samples 
received 60 I 2.0 M sodium acetate (pH 4.0), 550 I water saturated phenol and 

5 150 I chloroforrrusoamyl alcohol (49:1). After addiiton of those reagents, the 
samples were vortexed, centrifuged (10000 x. g; 20 min; 4C) and the aqueous 
phase transferred to a fresh tube. Isopropanol (600 I) was added and the RNA 
was precipitated overnight at -20°C. The samples were centrifuged (10000 x 
g; 20 minutes) and the supernatant was aspirated gently. The pellets were 

10 resuspended in 400 I of DEPC-treated water, extracted once with 

pheno!:chloroform (1:1), extracted with ehlorGformiisoamy! alcohol (24:1 ) and 

precipitated overnight at -20°C in 40 \ sodium acetate (3.0 M; pH 5.2) and 1 .0 

ml absolute ethanol. After precipitation, the samples were centrifuged (10000 
x g; 20 min), washed once with 70% ethanol, air dried for 5-10 minutes and 

15 resuspended in 20 I of DEPC-treated water. RNA concentrations were derived 
from optical densities. 

Total RNA (5 g in 10.5 L total volume in DEPC-H 2 0) was heated at 65°C 
for 5 minutes, and then added to tubes containing 4 I 5X MMLV-RT buffer, 2 I 
dNTPs, 2 I dT17 primer (10 pmol/ml), 0.5 I RNAsin (40 U/ml) and 1 I MMLV-RT 

20 (200 units/ 1). The reactions were incubated at 37°C for 1 hour. Afterward, the 
MMLV-RT was inactivated by heating at 95°C for 5 minutes. The samples 
were diluted by addition of 80 L water. 

Transcribed samples (5 I) were subjected to polymerase-chain reaction 
using standard methodologies (50 I total volume). Boden et a/.. Endocrinology . 

25 1 38:2820-28 (1997); Ausubel et a/., "Quantitation of rare DNAs by the 

polymerase chain reaction", in Current Protocols in Molecular Biology , chapter 
15.31-1, Wiley & Sons, Trenton, NJ (1990). Briefly, samples were added to 
tubes containing water and appropriate amounts of PCR buffer (25 mM MgC1 2> 
dNTPs, forward and reverse primers (for RLMPU; SEQ ID NOs: 15 and 16), 

30 32 P-dCTP, and DNA polymerase. Primers were designed to run consistently at 
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10 



20 



22 cycles for radioactive band detection and 33 cycles for amplification of PCR 
product for use as a screening probe (94°C, 30 sec, 58°C, 30 sec; 72°C, 20 
sec). 

Sequencing of the agarose gel-purified MG63 osteosarcoma-derived 
PCR product gave a sequence more than 95% homologous to the RLMPU 
PCR product. That sequence is designated HLMP unique region (HLMPU; 
SEQ ID NO: 6). 

EXAMPLE 17: Screening of reverse-transcriptase-derived MG63 cDNA 



Screening was performed with PCR using specific primers (SEQ ID 
NOs: 16 and 17) as described in Example 7. A 717 base-pair MG63 PCR 
product was agarose gel purified and sequenced with the given primers (SEQ. 
ID NOs: 12, 15, 16, 17, 18, 27 and 28). Sequences were confirmed a 
15 minimum of two times in both directions. The MG63 sequences were aligned 
against each other and then against the full-length rat LMP cDNA sequence to 
obtain a partial human LMP cDNA sequence (SEQ ID NO: 7). 



EXAMPLE 18: Screening of a Human Heart cDNA Library 



Based on Northern blot experiments, it was determined that LMP-1 is 
expressed at different levels by several different tissues, including human heart 
muscle. A human heart cDNA library was therefore examined. The library was 
plated at 5 x 10 4 pfu/ml onto agar plates (LB bottom agar) and plates were 

25 grown overnight at 37° C. Filter membranes were overlaid onto the plates for 
two minutes. Afterward, the filters denatured, rinsed, dried, UV cross-linked 
and incubated in pre-hyridization buffer (2X PIPES [pH 6.5]; 5% formamide, 
1% SDS, 100 g/ml denatured salmon sperm DNA) for 2 h at 42°C. A 
radiolabeled, LMP-unique, 223 base-pair probe ( 32 P, random primer labelling; 

30 SEQ ID NO: 6) was added and hybridized for 18 h at 42°C. Following 

hybridization, the membranes were washed once at room temperature (10 min, 
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1 x SSC, 0.1% SDS) and three times at 55°C (15 min, 0.1 x SSC, 0.1% SDS). 
Double-positive plaque-purified heart library clones, identified by 
autoradiography, were rescued as lambda phagemids according to the 
manufacturers* protocols (Stratagene, La Jolla, CA). 
5 Restriction digests of positive clones yielded cDNA inserts of varying 

sizes. Inserts greater than 600 base-pairs in length were selected for initial 
screening by sequencing. Those inserts were sequenced by standard 
methods as described in Example 11. 

One clone, number 7, was also subjected to automated sequence 
10 analysis using primers corresponding to SEQ ID NOs: 11-14, 16 and 27. The 
sequences obtained by these methods were routinely 97-100% homologous. 
Clone 7 (Partial Human LMP-1 cDNA from a heart library; SEQ. ID NO: 8) 
*"~rohtained slTqu^hce that was more than 87% homologous to the rat LMP 
cDNA sequence in the translated region. 

15 

EXAMPLE 19: Determination of Full-Length Human LMP-1 cDNA 

Overlapping regions of the MG63 human osteosarcoma cell cDNA 
sequence and the human heart cDNA clone 7 sequence were used to align 

20 those two sequences and derive a complete human cDNA sequence of 1644 
base-pairs. NALIGN, a program in the PCGENE software package, was used 
to align the two sequences. The overlapping regions of the two sequences 
constituted approximately 360 base-pairs having complete homology except for 
a single nucleotide substitution at nucleotide 672 in the MG63 cDNA (SEQ ID 

25 NO: 7) with clone 7 having an "A" instead of a U G" at the corresponding 
nucleotide 516 (SEQ ID NO: 8). 

The two aligned sequences were joined using SEQIN, another 
subprogram of PCGENE, using the "G" substitution of the MG63 osteosarcoma 
cDNA clone. The resulting sequence is shown in SEQ ID NO: 9. Alignment of 

30 the novel human-derived sequence with the rat LMP-1 cDNA was 

accomplished with NALIGN. The full-length human LMP-1 cDNA sequence 
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(SEQ. ID NO: 9) is 87.3% homologous to the translated portion of rat LMP-1 
cDNA sequence. 

EXAMPLE 20: Determination of Amino Acid Sequence of Human LMP-1 

5 

The putative amino acid sequence of human LMP-1 was determined 
with the PCGENE subprogram TRANSL. The open reading frame in SEQ ID 
NO: 9 encodes a protein comprising 457 amino acids (SEQ. ID NO: 10). Using 
the PCGENE subprogram Palign, the human LMP-1 amino acid sequence was 
10 found to be 94.1% homologous to the rat LMP-1 amino acid sequence. 

EXAMPLE 21: Determination of the 5 Prime Untranslated Region of the 
Human LMP cDNA " 

15 MG63 5* cDNA was amplified by nested RT-PCR of MG63 total RNA 

using a 5' rapid amplification of cDNA ends (5' RACE) protocol. This method 
included first strand cDNA synthesis using a lock-docking oligo (dT) primer with 
two degenerate nucleotide positions at the 3* end (Chenchik et a/., 
CLONTECHniques. X:5 (1995); Borson et a/., PC Methods Applic . 2:144 

20 (1993)). Second-strand synthesis is performed according to the method of 
Gubler et a/., Gene. 25:263 (1983), with a cocktail of Escherichia coli DNA 
polymerase I, RNase H, and E. coli DNA ligase. After creation of blunt ends 
with T4 DNA polymerase, double-stranded cDNA was ligated to the fragment 
(5' -CTAATACGACTCACTATAGGGCTCGAGCGGCCGCCCGGGCAGGT- 3') 

25 (SEQ. ID NO: 19). Prior to RACE, the adaptor-ligated cDNA was diluted to a 
concentration suitable for Marathon RACE reactions (1:50). Adaptor-ligated 
double-stranded cDNA was then ready to be specifically cloned. 

First-round PCR was performed with the adaptor-specific 
oligonucleotide, 5'-CCATCCTAATACGACTCACTATAGGGC- 3 1 (AP1) (SEQ.ID 

30 NO: 20) as sense primer and a Gene Specific Primer (GSP) from the unique 
region described in Example 16 (HLMPU). The second round of PCR was 
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performed using a nested primers GSP1-HLMPU (antisense/reverse primer) 
(SEQ. ID NO: 23) and GSP2-HLMPUF (SEQ. ID NO: 24) (see Example 16; 
sense/forward primer). PCR was performed using a commercial kit (Advantage 
cDNA PCR core kit; CloneTech Laboratories Inc., Palo Alto, CA) that utilizes 

5 an antibody-mediated, but otherwise standard, hot-start protocol. PCR 

conditions for MG63 cDNA included an initial hot-start denaturation (94°C, 60 
sec) followed by: 94°C, 30 sec; 60°C, 30 sec; 68°C, 4 min; 30 cycles. The first- 
round PCR product was approximately 750 base-pairs in length whereas the 
nested PCR product was approximately 230 base-pairs. The first-round PCR 

10 product was cloned into linearized pCR 2.1 vector (3.9 Kb). The inserts were 
sequenced in both directions using M13 Forward and Reverse primers (SEQ. 
ID NO: 11; SEQ. ID NO: 12) 

EXAMPLE 22: Determination of Full-length Human LMP-1 cDNA with 5 Prime 
15 UTR 



Overlapping MG63 human osteosarcoma cell cDNA 5'-UTR sequence 
(SEQ ID NO: 21), MG63 717 base-pair sequence (Example 17; SEQ ID NO: 8) 
and human heart cDNA clone 7 sequence (Example 18) were aligned to derive 
20 a novel human cDNA sequence of 1704 base-pairs (SEQ. ID NO: 22). The 
alignment was accomplished with NALIGN, (both PCGENE and Omiga 1.0; 
Intelligenetics). Over-lapping sequences constituted nearly the entire 717 
base-pair region (Example 17) with 100% homology. Joining of the aligned 
sequences was accomplished with SEQIN. 

25 

EXAMPLE 23: Construction of LIM Protein Expression Vector 

The construction of pHIS-5ATG LMP-1 s expression vector was carried 
out with the sequences described in Examples 17 and 18. The 717 base-pair 
30 clone (Example 1 7; SEQ ID NO: 7) was digested with Gal and EcoRV. A 

small fragment (-250 base-pairs) was gel purified. Clone 7 (Example 18; SEQ 

-33- 



BNSOOCID: <WO 0066178A1_I_> 



WO 00/66178 



PCT/US00/11664 



ID NO: 8) was digested with Q\a\ and Xba l and a 1400 base-pair fragment was 
gel purified. The isolated 250 base-pair and 1400 base-pair restriction 
fragments were ligated to form a fragment of -1650 base-pairs. 

Due to the single nucleotide substitution in Clone 7 (relative to the 717 
5 base-pair PCR sequence and the original rat sequence) a stop codon at 
translated base-pair 672 resulted. Because of this stop codon, a truncated 
(short) protein was encoded, hence the name LMP-1s. This was the construct 
used in the expression vector (SEQ ID NO: 32). The full length cDNA 
sequence with 5' UTR (SEQ ID NO: 33) was created by alignment of SEQ ID 

10 NO: 32 with the 5' RACE sequence (SEQ ID NO: 21). The amino acid 

sequence of LMP-1s (SEQ ID NO: 34) was then deduced as a 223 amino acid 
protein and confirmed by Western blot (as in Example 15) to run at the 
predicted molecular weight of - 23.7 kD. 

The pHis-ATG vector (InVitrogen, Carlsbad, CA) was digested with 

15 Eco RV and Xba l. The vector was recovered and the1650 base-pair restriction 
fragment was then ligated into the linearized pHis-ATG. The ligated product 
was cloned and amplified. The pHis-ATG-LMP-1s Expression vector, also 
designated pHIS-A with insert HLMP-1s, was purified by standard methods. 

20 EXAMPLE 24: Induction of Bone Nodule Formation and Mineralization In vitro 
with LMP Expression Vector 

Rat Calvarial cells were isolated and grown in secondary culture 
according to Example 1 . Cultures were either unstimulated or stimulated with 
25 glucocorticoid (GC) as described in Example 1 . A modification of the Superfect 
Reagent (Qiagen, Valencia, CA) transfection protocol was used to transfect 3 
g/well of each vector into secondary rat calvarial osteoblast cultures according 
to Example 25. 

Mineralized nodules were visualized by Von Kossa staining, as described in 
30 Example 3. Human LMP-1s gene product overexpression alone induced 
bone nodule formation (-203 nodules/well) in vitro. Levels of nodules were 

-34- 

JNSDOCID: <WO 00661 7BA1 I > 



WO 00/66178 PCT/US00/1 1664 

approximately 50% of those induced by the GC positive control (-41 2 
nodules/well). Other positive controls included the pHisA-LMP-Rat expression 
vector (-152 nodules/well) and the pCMV2/LMP-Rat-Fwd Expression vector 
(-206 nodules/well), whereas the negative controls included the pCMV2/LMP- 
5 Rat-Rev. Expression vector (-2 nodules/well) and 

untreated (NT) plates (-4 nodules/well). These data demonstrate that the 
human cDNA was at least as osteoinductive as the rat cDNA. The effect was 
less than that observed with GC stimulation, most likely due to suboptimal 
doses of Expression vector. 

10 

EXAMPLE 25: LMP-lnduced Cell Differentiation In Vitro and In Vivo 

~ - = ~~ -"The rattMPxDNAinclone 10^(see Example 12) was = excised from~ 
the vector by double-digesting the clone with Not l and Apa l overnight at 37°C. 

15 Vector pCMV2 MCS (InVitrogen, Carlsbad, CA) was digested with the same 
restriction enzymes. Both the linear cDNA fragment from clone 10-4 and 
pCMV2 were gel purified, extracted and ligated with T4 ligase. The ligated 
DNA was gel purified, extracted and used to transform E. coli JM109 cells for 
amplification. Positive agar colonies were picked, digested with Notl and Apa l 

20 and the restriction digests were examined by gel electrophoresis. Stock 
cultures were prepared of positive clones. 

A reverse vector was prepared in analogous fashion except that the 
restriction enzymes used were Xba l and Hindi II. Because these restriction 
enzymes were used, the LMP cDNA fragment from clone 10-4 was inserted 

25 into pRc/CMV2 in the reverse (that is, non-translatable) orientation. The 
recombinant vector produced is designated pCMV2/RLMP. 

An appropriate volume of pCMV10-4 (60 nM final concentration is 
optimal [3 g]; for this experiment a range of 0-600 nM/well [0-30 g/well] final 
concentration is preferred) was resuspended in Minimal Eagle Media (MEM) to 

30 450 I final volume and vortexed for 10 seconds. Superfect was added (7.5 l/ml 
final solution), the solution was vortexed for 10 seconds and then incubated at 
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room termperature for 10 minutes. Following this incubation, MEM 
supplemented with 10% FBS (1 ml/well; 6 ml/plate) was added and mixed by 
pipetting. 

The resulting solution was then promptly pipetted (1 ml/well) onto 
5 washed ROB cultures. The cultures were incubated for 2 hours at 37°C in a 
humidified atmosphere containing 5% C0 2 . Afterward, the cells were gently 
washed once with sterile PBS and the appropriate normal incubation medium 
was added. 

Results demonstrated significant bone nodule formation in all rat cell 
10 cultures which were induced with pCMV10-4. For example, pCMV10-4 

transfected cells produced 429 nodules/well. Positive control cultures, which 
were exposed to Trm, produced 460 nodules/well. In contrast, negative 
" controls, which received no treatment, produced 1 nodule/well. Similarly, when 

cultures were transfected with pCMV10-4 (reverse), no nodules were 
15 observed. 

For demonstrating de novo bone formation in vivo, marrow was 
aspirated from the hindlimbs of 4-5 week old normal rats (rnu/+; heterozygous 
for recessive athymic condition). The aspirated marrow cells were washed in 
alpha MEM, centrifuged, and RBCs were lysed by resuspending the pellet in 
20 0.83% NH 4 CI in 10 mM Tris (pH 7.4). The remaining marrow cells were 

washed 3x with MEM and transfected for 2 hours with 9 g of pCMV-LMP-1s 
(forward or reverse orientation) per 3 x 10 6 cells. The transfected cells were 
then washed 2X with MEM and resuspended at a concentration of 3 x 10 7 
cells/ml. 

25 The cell suspension (100 I) was applied via sterile pipette to a sterile 2 x 

5 mm type I bovine collagen disc (Sulzer Orthopaedics, Wheat Ridge, CO). 
The discs were surgically implanted subcutaneously on the skull, chest, 
abdomen or dorsal spine of 4-5 week old athymic rats (mu/rnu). The animals 
were scarified at 3-4 weeks, at which time the discs or surgical areas were 

30 excised and fixed in 70% ethanol. The fixed specimens were analyzed by 
radiography and undecalcified histologic examination was performed on 5 m 
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thick sections stained with Goldner Trichrome. Experiments were also 

performed using devitalized (guanidine extracted) demineralized bone matrix 

(Osteotech, Shrewsbury, NJ) in place of collagen discs. 

Radiography revealed a high level of mineralized bone formation that 
5 conformed to the form of the original collagen disc containing LMP-1 s 

transfected marrow cells. No mineralized bone formation was observed in the 

negative control (cells transfected with a reverse-oriented version of the LMP- 

1s cDNA that did not code for a translated protein), and absorption of the 

carrier appeared to be well underway. 
10 Histology revealed new bone trabeculae lined with osteroblasts in the 

LMP-1 s transfected implants. No bone was seen along with partial resorption 

of the carrier in the negative controls. 

Radiography of a further experiment in which 18 sets (9 negative control 

pCMV-LMP-REV & 9 experimental pCMV-LMP-1s) of implants were added to 
15 sites alternating between lumbar and thoracic spine in athymic rats 

demonstrated 0/9 negative control implants exhibiting bone formation (spine 

fusion) between vertebrae. All nine of the pCMV-LMP-1s treated implants 

exhibited solid bone fusions between vertebrae. 

20 EXAMPLE 26: The Synthesis of pHIS-5' ATG LMP-1 s Expression Vector from 
the sequences Demonstrated in Examples 2 and 3. 

The 717 base-pair clone (Example 17) was digested with C[al and 
EcoRV (New England Biologicals, city, MA). A small fragment (-250 base- 

25 pairs) was gel purified. Clone No. 7 (Example 18) was digested with CJal and 
Xba l. A 1400 base-pair fragment was gel purified from that digest. The 
isolated 250 base-pair and 1400 base-pair cDNA fragments were ligated by 
standard methods to form a fragment of -1650 bp. The pHis-A vector 
(InVitrogen) was digested with EgoRV and Xbal. The linearized vector was 

30 recovered and ligated to the chimeric 1650 base-pair cDNA fragment. The 
ligated product was cloned and amplified by standard methods, and the pHis- 
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A-5' ATG LMP-1s expression vector, also denominated as the vector pHis-A 
with insert HLMP-1s, was deposited at the ATCC as previously described. 

FXAMPLE 27: The Induction of Bone Nodule Formation and Mineralization In 
5 Vitro With pHis-5' ATG LMP-1s Expression Vector 

Rat calvarial cells were isolated and grown in secondary culture 
according to Example 1 . Cultures were either unstimulated or stimulated with 
glucocorticoid (GC) according to Example 1. The cultures were transfected 

10 with 3 g of recombinant 

pHis-A vector DNA/well as described in Example 25. Mineralized nodules 
were visualized by Von Kossa staining according to Example 3. 

~ Human LMFMs gene product overexpression alone (i.e., without GC 
stimulation) induced significant bone nodule formation (-203 nodules/well) in 

15 vitro. This is approximately 50% of the amount of nodules produced by cells 
exposed to the GC positive control (-412 nodules/well). Similar results were 
obtained with cultures transfected with pHisA-LMP-Rat Expression vector 
(-152 nodules/well) and pCMV2/LMP-Rat-Fwd (-206 nodules/well). In 
contrast, the negative control pCMV2/LMP-Rat-Rev yielded (-2 nodules/well), 

20 while approximately 4 nodules/well were seen in the untreaied plates. These 
data demonstrate that the human LMP-1 cDNA was at least as osteoinductive 
as the rat LMP-1 cDNA in this model system. The effect in this experiment was 
less than that observed with GC stimulation; but in some the effect was 
comparable. 



25 



EXAMPLE 28: LMP Induces Secretion of a Soluble Osteoinductive Factor 



Overexpression of RLMP-1 or HLMP-1s in rat calvarial osteoblast 
cultures as described in Example 24 resulted in significantly greater nodule 
30 formation than was observed in the negative control. To study the mechanism 
of action of LIM mineralization protein conditioned medium was harvested at 
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different time points, concentrated to 10 X, sterile filtered, diluted to its original 
concentration in medium containing fresh serum, and applied for four days to 
untransfected cells. 

Conditioned media harvested from cells transfected with RLMP-1 or 
5 HLMP-1s at day 4 was approximately as effective in inducing nodule formation 
as direct overexpression of RLMP-1 in transfected cells. Conditioned media 
from cells transfected with RLMP-1 or HLMP-1 in the reverse orientation had 
no apparent effect on nodule formation. Nor did conditioned media harvested 
from LMP-1 transfected cultures before day 4 induce nodule formation. These 

10 data suggest that expression of LMP-1 caused the synthesis and/or secretion 
of a soluble factor, which did not appear in culture medium in effectie amounts 
until 4 days post transfection. 

Since overexpression of rLMP-1 resulted in the secretion of an 
osteoinductive factor into the medium, Western blot analysis was used to 

15 determine if LMP-1 protein was present in the medium. The presence of rLMP- 
1 protein was assessed using antibody specific for LMP-1 (QDPDEE) and 
detected by conventional means. LMP-1 protein was found only in the cell 
layer of the culture and not detected in the medium. 

Partial purification of the osteoinductive soluble factor was accomplished 

20 by standard 25% and 100% ammonium sulfate cuts followed by DE-52 anion 
exchange batch chromatography (100 mM or 500 mM NaCI). All activity was 
observed in the high ammonium sulfate, high NaCI fractions. Such localization 
is consistent with the possibility of a single factor being responsible for 
conditioning the medium. 

25 

EXAMPLE 29: Gene Therapy In Lumbar Spine Fusion Mediated by Low Dose 
Adenovirus 

This study determined the optimal dose of adenoviral delivery of the 
30 LMP-1 cDNA (SEQ ID NO: 2) to promote spine fusion in normal, that is, 
immune competent, rabbits. 
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A replication-deficient human recombinant adenovirus was constructed 
with the LMP-1 cDNA (SEQ ID NO: 2) driven by a CMV promoter using the 
Adeno-Quest™ Kit (Quantum Biotechnologies, Inc., Montreal). A commercially 
available (Quantum Biotechnologies, Inc., Montreal) recombinant adenovirus 
5 containing the beta-galactosidase gene was used as a control. 

Initially, an in vitro dose response experiment was performed to 
determine the optimal concentration of adenovirus-delivered LMP-1 ("AdV- 
LMP-1 M ) to induce bone differentiation in rat calvarial osteoblast cultures using 
a 60-minute transduction with a multiplicity of infection ("MOI") of 0.025, 0.25, 
10 2.5, or 25 plaque-forming units (pfu) of virus per cell. Positive control cultures 
were differentiated by a 7-day exposure to 1 0 s M glucocorticoid ("GC"). 
. Negative control cultures were left untreated. On day 14, the number of 
mineralized bone nodules was counted after von Kossa staining of the 
cultures, and the level of osteocalcin secreted into the medium (pmol/mL) was 
15 measured by radioimmunoassay (mean ± SEM). 

The results of this experiment are shown in Table I. Essentially no 
spontaneous nodules formed in the untreated negative control cultures. The 
data show that a MOI equal to 0.25 pfu/cell is most effective for osteoinducing 
bone nodules, achieving a level comparable to the positive control (GC). 
20 Lower and higher doses of adenovirus were less effective. 



TABLE I 



Outcome 






AdV-LMP-1 Dose (MOI) 




Neg. 
Ctrl. 


GC 


0.025 


0.25 


2.5 


25 


Bone 
Nodules 


0.5±0.2 


188±35 


79.8±13 


145.1±1 
3 


26.4±15 


87.6±2 


Osteocalcin 


1.0±0.1 


57.8±9 


28.6±1 1 


22.8±1 


18.3±3 


26.0±2 
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In vivo experiments were then performed to determine if the optimal in 
vitro dose was capable of promoting intertransverse process spine fusions in 
skeletally mature New Zealand white rabbits. Nine rabbits were anesthetized 
and 3 cc of bone marrow was aspirated from the distal femur through the 

5 intercondylar notch using an 18 gauge needle. The buffy coat was then 
isolated, a 10-minute transduction with AdV-LMP-1 was performed, and the 
cells were returned to the operating room for implantation. Single level 
posterolateral lumbar spine arthrodesis was performed with decortication of 
transverse processes and insertion of carrier (either rabbit devitalized bone 

10 matrix or a collagen sponge) containing 8-15 million autologous nucleated 
buffy coat cells transduced with either AdV-LMP-1 (MOI = 0.4) or AdV-BGal 
(MOI = 0.4). Rabbits were euthanized after 5 weeks and spine fusions were 
assessed by manual palpation, plain x-rays, CT scans, and uridecalcified 
histology. 

15 The spine fusion sites that received AdV-LMP-1 induced solid, 

continuous spine fusion masses in all nine rabbits. In contrast, the sites 
receiving AdV-BGal, or a lower dose of AdV-LMP-1 (MOI = 0.04) made little or 
no bone and resulted in spine fusion at a rate comparable to the carrier alone 
(< 40%). These results were consistent as evaluated by manual palpation, CT 

20 scan, and histology. Plain radiographs, however, sometimes overestimated 
the amount of bone that was present, especially in the control sites. LMP-1 
cDNA delivery and bone induction was successful with both of the carrier 
materials tested. There was no evidence of systemic or local immune 
response to the adenovirus vector. 

25 These data demonstrate consistent bone induction in a previously 

validated rabbit spine fusion model which is quite challenging. Furthermore, 
the protocol of using autogenous bone marrow cells with intraoperative ex vivo 
g ene transduction (10 minutes) is a more clinically feasible procedure than 
other methods that call for overnight transduction or cell expansion for weeks in 

30 culture. In addition, the most effective dose of recombinant adenovirus 

(MOI=0.25) was substantially lower than doses reported in other gene therapy 

-41 - 



BNSOOCID: <WO 00661 7AA1 I > 



WO 00/66178 



PCTAJSOO/11664 



applications (MOI-40-500). We believe this is due to the fact that LMP-1 is an 
intracellular signaling molecule and may have powerful signal amplification 
cascades. Moreover, the observation that the same concentration of AdV- 
LMP-1 that induced bone in cell culture was effective in vivo was also 
5 surprising given the usual required increase in dose of other growth factors 
when translating from cell culture to animal experiments. Taken together, 
these observations indicate that local gene therapy using adenovirus to deliver 
the LMP-1 cDNA is possible and the low dose required will likely minimize the 
negative effects of immune response to the adenovirus vector. 

10 

EXAMPLE 30: Use of Peripheral Venous Blood Nucleated Cells (Buffy Coat) 
for Gene Therapy With LMP-1 cDNA To Make Bone 



In four rabbits we performed spine fusion surgery as above (Example 
15 29) except the transduced cells were the buffy coat from venous blood rather 
than bone marrow. These cells were transfected with Adeno-LMP or pHIS- 
LMP plasmid and had equivalent successful results as when bone marrow cells 
were used. This discovery of using ordinary venous blood cells for gene 
delivery makes gene therapy more feasible clinically since it avoids painful 
20 marrow harvest under general anesthesia and yields two times more cells per 
mL of starting material. 

EXAMPLE 31: Isolation of Human LMP-1 Splice Variants 

25 Intron/Exon mRNA transcript splice variants are a relatively common 

regulatory mechanism in signal transduction and cellular/tissue development. 
Splice variants of various genes have been shown to alter protein-protein, 
protein-DNA, protein-RNA f and protein-substrate interactions. Splice variants 
may also control tissue specificity for gene expression allowing different forms 

30 (and therefore functions) to be expressed in various tissues. Splice variants 
are a common regulatory phenomenon in cells. It is possible that the LMP 

-42- 

3NSDOCIE>. <WO 0066178A1_I_> 



WO 00/66178 



PCI7US00/11664 



splice variants may result in effects in other tissues such as nerve 
regeneration, muscle regeneration, or development of other tissues. 

To screen a human heart cDNA library for splice variants of the HLMP-1 
sequence, a pair of PCR primer corresponding to sections of SEQ ID NO: 22 
5 was prepared. The forward PCR primer, which was synthesized using 

standard techniques, corresponds to nucleotides 35-54 of SEQ ID NO: 22. It 
has the following sequence: 



10 



5' GAGCCGGCATCATGGATTCC 3' (SEQ ID NO: 35) 

The reverse PCR primer, which is the reverse complement of 
nucleotides 820-839 in SEQ ID NO: 22, has the following sequence: 



5 1 GCTGCCTGCACAATGGAGGT 3' (SEQ ID NO: 36) 

15 

The forward and reverse PCR primers were used to screen human heart 
cDNA (ClonTech, Cat No. 7404-1) for sequences similar to HLMP-1 by 
standard techniques, using a cycling protocol of 94°C for 30 seconds, 64°C for 
30 seconds, and 72°C for 1 minute, repeated 30 times and followed by a 10 

20 minute incubation at 72°C. The amplification cDNA sequences were gel- 
purified and submitted to the Emory DNA Sequence Core Facility for 
sequencing. The clones were sequenced using standard techniques and the 
sequences were examined with PCGENE (Intelligenetics; Programs SEQUIN 
and NALIGN) to determine homology to SEQ ID NO: 22. Two homologous 

25 nucleotide sequences with putative alternative splice sites compared to SEQ ID 
NO: 22 were then translated to their respective protein products with 
Intelligenetic's program TRANSL. 

One of these two novel human cDNA sequences (SEQ ID NO: 37) 
comprises 1456 bp: 

30 

10 20 30 40 50 60 
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CGACGCAGAG CAGCGCCCTG GCCGGGCCAA GCAGGAGCCG GCATCATGGA TTCCTTCAAG 
70 80 90 100 110 120 

GTAGTGCTGG AGGGGCCAGC ACCTTGGGGC TTCCGGCTGC AAGGGGGCAA GGACTTCAAT 
130 140 150 160 170 180 

5 GTGCCCCTCT CCATTTCCCG GCTCACTCCT GGGGG CAAAG CGGCGCAGGC CGGAGTGGCC 

190 200 210 220 230 240 

GTGGGTGACT GGGTGCTGAG CATCGATGGC GAGAATGCGG GTAGCCTCAC ACACATCGAA 
250 260 270 280 290 300 

GCTCAGAACA AGATCCGGGC CTGCGGGGAG CGCCTCAGCC TGGGCCTCAG CAGGGCCCAG 
10 310 320 * 330 340 * 350 360 

CCGGTTCAGA GCAAACCGCA GAAG GTGCAG ACCCCTGACA A ACAGCCGCT CCGACCGCTG 
" 370 380 390 400 410 420 

GTCCCAGATG CCAGCAAGCA GCGGCTGATG GAGAACACAG AGGACTGGCG GCCGCGGCCG 
430 440 450 460 470 480 

15 GGGACAGGCC AGTCGCGTTC CTTCCGCATC CTTGCCCACC TCACAGGCAC CGAGTTCATG 

490 500 510 520 530 540 

CAAGACCCGG ATGAGGAGCA CCTGAAGAAA TCAAGCCAGG TG CCCAGG AC AGAAGCCCCA 
550 560 570 580 590 600 

GCCCCAGCCT CATCTACACC CCAGGAGCCC TGGCCTGGCC CTACCGCCCC CAGCCCTACC 
20 610 620 630 640 650 660 

AGCCGCCCGC CCTGGGCTGT GGACCCTGCG TTTGCCGAGC GCTATGCCCC GGACAAAACG 
670 680 690 700 710 720 

AGCACAGTGC TGACCCGGCA CAGCCAGCCG GCCACGCCCA CGCCGCTGCA GAGCCGCACC 
730 740 750 760 770 780 

25 TCCATTGTGC AGGCAGCTGC CGGAGGGGTG CCAGGAGGGG GCAGCAACAA CGGCAAGACT 

790 800 810 820 830 840 

CCCGTGTGTC ACCAGTGCCA CAAGGTCATC CGGGGCCGCT ACCTGGTGGC GTTGGGCCAC 
850 860 870 880 890 900 
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GCGTACCACC CGGAGGAGTT TGTGTGTAGC CAGTGTGGGA AGGTCCTGGA AGAGGGTGGC 
910 920 930 940 950 960 

TTCTTTGAGG AGAAGGGCGC CATCTTCTGC CCACCATGCT ATGACGTGCG CTATGCACCC 
970 980 990 1000 1010 1020 

5 AGCTGTGCCA AGTGCAAGAA GAAGATTACA GGCGAGATCA TGCACGCCCT GAAGATGACC 

1030 1040 1050 1060 1070 1080 

TGGCACGTGC ACTG CTTTAC CTGTGCTGCC TGCAAGACGC CCATCCGGAA CAGGGCCTTC 
1090 1100 1110 1120 1130 1140 

TACATGGAGG AGGGCGTGCC CTATTGCGAG CGAGACTATG AGAAGATGTT TGGCACGAAA 
10 1150 1160 1170 1180 1190 1200 

TGCCATGGCT GTGACTTCAA GATCGACGCT GGGGACCGCT TCCTGGAGGC CCTGGGCTTC 
1210 1220 1230 1240 1250 1260 

AGCTGGCATG ACACCTGCTT CGTCTGTGCG ATATGTCAGA TCAACCTGGA AGGAAAGACC 
1270 1280 1290 1300 1310 1320 

15 TTCTACTCCA AGAAGGACAG GCCTCTCTGC AAGAGCCATG CCTTCTCTCA TGTGTGAGCC 

1330 1340 1350 1360 1370 1380 

CCTTCTGCCC ACAGCTGCCG CGGTGGCCCC TAGCCTGAGG GGCCTGGAGT CGTGGCCCTG 
1390 1400 1410 1420 1430 1440 

CATTTCTGGG TAGGGCTGGC AATGGTTGCC TTAACCCTGG CTCCTGGCCC GAGCCTGGGC 
20 1450 

TCCCGGGCCC TGCCCA 

Reading frame shifts caused by the deletion of a 1 19 bp fragment 
(between *) and the addition of a 17 bp fragment Underlined ) results in a 
25 truncated gene product having the following derived amino acid sequence 
(SEQ ID NO: 38): 

10 20 30 40 50 60 
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MDSFKWLEG PAPWGFRLQG GKDFNVPLSI SRLTPGGKAA QAGVAVGDWV LSIDGENAGS 
70 80 90 100 110 120 

LTHIEAQNKI RACGERLSLG LSRAQPVQNK PO KVOTPDK O PLRPLVPDAS KQRLMENTED 
130 140 150 160 170 180 

5 WRPRPGTGQS RS FRILAHLT GTEFMQDPDE EHLKKSSQVP RTEAPAPASS TPQEPWPGPT 
190 200 210 220 230 240 

APSPTSRPPW AVDPAFAERY APDKTSTVLT RHSQPATPTP LQSRTSIVQA AAGGVPGGGS 
250 260 270 280 290 300 

NNGKTPVCHQ CHQVIRARYL VALGHAYHPE EFVCSQCGKV LEEGGFFEEK GAIFCPPCYD 
10 310 320 330 340 350 360 

VRYAPSCAKC KKKITGEIMH ALKMTWHVLC FTCAACKTPI RNRAFYMEEG VPYCERDYEK 
370 380 390 400~ 410 420 

MFGTKCQWCD FKIDAGDRFL EALGFSWHDT CFVCAICQIN LEGKTFYSKK DRPLCKSHAF 
SHV 

15 This 423 amino acid protein demonstrates 100% homology to the 

protein shown in Sequence ID No. 10, except for the sequence in the 
highlighted area (amino acids 94-99), which are due to the nucleotide changes 
depicted above. 

The second novel human heart cDNA sequence (SEQ ID NO: 39) 
20 comprises 1 575 bp: 

10 20 30 40 50 60 

CGACGCAGAG CAGCGCCCTG GCCGGGCCAA GCAGGAGCCG GCATCATGGA TTCCTTCAAG 
70 80 90 100 110 120 

25 GTAGTGCTGG AGGGGCCAGC ACCTTGGGGC TTCCGGCTGC AAGGGGGCAA GGACTTCAAT 

130 140 150 160 170 180 

GTGCCCCTCT CCATTTCCCG GCTCACTCCT GGGGGCAAAG CGGCGCAGGC CGGAGTGGCC 
190 200 210 220 230 240 
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GTGGGTGACT GGGTGCTGAG CATCGATGGC GAGAATGCGG GTAGCCTCAC ACACATCGAA 
250 260 270 280 290 300 

GCTCAGAACA AGATCCGGGC CTGCGGGGAG CGCCTCAGCC TGGGCCTCAG CAGGGCCCAG 
310 320 330 340 350 360 

5 CCGGTTCAGA GCAAACCGCA GAAGGCCTCC GCCCCCGCCG CGGACCCTCC GCGGTACACC 

370 380 390 400 410 420 

TTTGCACCCA GCGTCTCCCT CAACAAGACG GCCCGGCCCT TTGGGGCGCC CCCGCCCGCT 
430 440 450 460 470 480 

GACAGCGCCC CGCAACAGAA TGG GTGCAGA CCCCTGACAA ACAGCCGCTC CGACCGCTGG 
10 490 500 510 520 530 540 

TCCCAGATGC CAGCAAGCAG CGGCTGATGG AGAACACAGA GGACTGGCGG CCGCGGCCGG 
^ 550 560 " 570 580 ^ 590 600 

GGACAGGCCA GTCGCGTTCC TTCCGCATCC TTGCCCACCT CACAGGCACC GAGTTCATGC 
610 620 630 640 650 660 

15 AAGACCCGGA TGAGGAGCAC CTGAAGAAAT CAAGCCAGGT GCCCAGGACA GAAGCCCCAG 

670 680 690 700 710 720 

CCCCAGCCTC ATCTACACCC CAGGAGCCCT GGCCTGGCCC TACCGCCCCC AGCCCTACCA 
730 740 750 760 770 780 

GCCGCCCGCC CTGGGCTGTG GACCCTGCGT TTGCCGAGCG CTATGCCCCG GACAAAACGA 
20 790 800 810 820 830 840 

GCACAGTGCT GACCCGGCAC AGCCAGCCGG CCACGCCCAC GCCGCTGCAG AGCCGCACCT 
850 860 870 880 890 900 

CCATTGTGCA GGCAGCTGCC GGAGGGGTGC CAGGAGGGGG CAGCAACAAC GGCAAGACTC 
910 920 930 940 950 960 

25 CCGTGTGTCA CCAGTGCCAC AAGGTCATCC GGGGCCGCTA CCTGGTGGCG TTGGGCCACG 

970 980 990 1000 1010 1020 

CGTACCACCC GGAGGAGTTT GTGTGTAGCC AGTGTGGGAA GGTCCTGGAA GAGGGTGGCT 
1030 1040 1050 1060 1070 1080 
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TCTTTGAGGA GAAGGGCGCC ATCTTCTGCC CACCATGCTA TGACGTGCGC TATGCACCCA 
1090 1100 1110 1120 1130 1140 

GCTGTGCCAA GTGCAAGAAG AAGATTACAG GCGAGATCAT GCACGCCCTG AAGATGACCT 
1150 1160 1170 1180 1190 1200 

5 GGCACGTGCA CTGCTTTACC TGTGCTGCCT GCAAGACGCC CATCCGGAAC AGGGCCTTCT 

1210 1220 1230 1240 1250 1260 

ACATGGAGGA GGGCGTGCCC TATTGCGAGC GAGACTATGA GAAGATGTTT GGCACGAAAT 
1270 1280 1290 1300 1310 1320 

GCCATGGCTG TGACTTCAAG ATCGACGCTG GGGACCGCTT CCTGGAGGCC CTGGGCTTCA 
10 1330 1340 1350 1360 1370 1380 

GCTGGCATGA CACCTGCTTC GTCTGTGCGA TATGTCAGAT CAACCTGGAA GGAAAGACCT 
— — - 13 90 1400 1410 r 1420 1430 1440 

TCTACTCCAA GAAGGACAGG CCTCTCTGCA AGAGCCATGC CTTCTCTCAT GTGTGAGCCC 
14 50 14 60 1470 14 8 0 14 90 15 00 

15 CTTCTGCCCA CAGCTGCCGC GGTGGCCCCT AGCCTGAGGG GCCTGGAGTC GTGGCCCTGC 

1510 1520 1530 1540 1550 1560 

ATTTCTGGGT AGGGCTGGCA ATGGTTGCCT TAACCCTGGC TCCTGGCCCG AGCCTGGGCT 
1570 

CCCGGGCCCT GCCCA 

20 Reading frame shifts caused by the addition of a 17 bp fragment 

(bolded, italicized and underlined) results in an early translation stop codon at 
position 565-567 (underlined). 

The derived amino acid sequence (SEQ ID NO: 40) consists of 153 
amino acids: 



25 



10 20 30 40 50 60 

MDSFKWLEG PAPWGFRLQG GKDFNVPLSI SRLTPGGKAA QAGVAVGDWV LSIDGENAGS 
70 80 90 100 110 120 
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LTHIEAQNKI RACGERLSLG LSRAQPVQSK PO KASAPAAD PPRYTFAPSV SLNKTAR PFG 

130 140 150 

APPPADSAPO ONGCRPLTNS RSDRWSOMPA SSG 

This protein demonstrates 100% homology to SEQ ID NO: 10 until 
5 amino acid 94, where the addition of the 17 bp fragment depicted in the 
nucleotide sequence results in a frame shift. Over amino acids 94-153, the 
protein is not homologous to SEQ ID NO: 10. Amino acids 154-457 in SEQ ID 
NO: 10 are not present due to the early stop codon depicted in nucleotide 
sequence. 



) 00661 78A1J_> 



-49- 



WO 00/66178 



PCT/USOO/11664 



EXAMPLE 32: Genomic HLMP-1 Nucleotide Sequence 

Applicants have identified the genomic DNA sequence encoding HLMP- 
1, including putative regulatory elements associated with HLMP-1 expression. 
5 The entire genomic sequence is shown in SEQ ID. NO: 41. This sequence 
was derived from AC023788 (clone RP1 1-564G9), Genome Sequencing 
Center, Washington University School of Medicine, St. Louis, MO. 

The putative promoter region for HLMP-1 spans nucleotides 2,660- 
8,733 in SEQ ID NO: 41. This region comprises, among other things, at least 

10 ten potential glucocorticoid response elements ("GREs") (nucleotides 6148- 
6153, 6226-6231, 6247-6252, 6336-6341, 6510-6515, 6552-6557, 6727-6732, 
6752-6757, 7738-7743, and 8255-8260), twelve potential Sma-2 homologues 
to Mothers against Drosophilla decapentaplegic ("SMAD") binding element— = 
sites (nucleotides 3569-3575, 4552-4558, 4582^588, 5226-5232, 6228-6234, 

15 6649-6655, 6725-6731, 6930-6936, 7379-7384, 7738-7742, 8073-8079, and 
8378-8384), and three TATA boxes (nucleotides 5910-5913, 6932-6935, and 
7380-7383). The three TATA boxes, all of the GREs, and eight of the SMAD 
binding elements ("SBEs") are grouped in the region spanning nucleotides 
5,841-8,733 in SEQ ID NO: 41. These regulatory elements can be used, for 

20 example, to regulate expression of exogenous nucleotide sequences encoding 
proteins involved in the process of bone formation. This would permit systemic 
administration of therapeutic factors or genes relating to bone formation and 
repair, as well as factors or genes associated with tissue differentiation and 
development. 

25 In addition to the putative regulatory elements, 13 exons corresponding 

to the nucleotide sequence encoding HLMP-1 have been identified. These 
exons span the following nucleotides in SEQ ID NO: 41: 



Exon 1 



8733-8767 



Exon 2 



9790-9895 



Exon 3 



13635-13787 
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CvAn A. 
CXUII *f 


I OO/ f I Oc7Uf 


tZXUIl yj 


14^7-14^09 


Pvr\n f\ 

lxom u 


1 ^1^11-1^907 




1*\401 1*vi^7 


cxori O 


1^48^ 1RR4R 
I OHOO- I Dj4j 


P- v/^i n Q 
CaUII y 


IRRftQ 1RQ9^ 
1 OOOi? - 1 Dc/ZO 


Exon 10 


18068-18248 


Exon 1 1 


22117-22240 


Exon 12 


22323-22440 


Exon 13 


22575-2291 1 



In HLMP-2 there is another exon (Exon 5A), which spans nucleotides 
14887-14904. 

All cited publications and patents are hereby incorporated by reference 
in their entirety. 

15 While the foregoing specification teaches the principles of the present 

invention, with examples provided for the purpose of illustration, it will be 
appreciated by one skilled in the art from reading this disclosure that various 
changes in form and detail can be made without departing from the true scope 
of the invention. 
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We claim: 

1. An isolated nucleic acid molecule comprising a SEQ ID NO: 37 or 
SEQ ID NO: 39. 

2. An isolated human LMP protein encoded by SEQ ID NO: 37 or SEQ 
5 ID NO: 39. 

3. A vector comprising the isolated nucleic acid molecule of claims 1 . 

4. A host cell comprising the vector of claim 3. 

5. The host cell of claim 4, wherein the host cell is selected from the 
group consisting of prokaryotic cells, yeast cells and mammalian 

10 cells 

6. The isolated nucleic acid molecule of claim 1 , further comprising a 

label. 

" 7. A human LIM mineralization protein comprising an amino acid 

sequence selected from the group consisting of SEQ ID NO: 38 and SEQ ID 
15 NO: 40. 

8. A monoclonal antibody specific for a HLMP-2 (SEQ ID NO: 38) or 
HLMP-3 (SEQ ID NO: 40). 

9. A method of inducing bone formation comprising transfecting 
osteogenic precursor cells or peripheral blood leukocytes with an isolated 

20 nucleic acid molecule comprising SEQ ID NO: 39. 

10. The method of claim 9, wherein the isolated nucleic acid molecule is 
in a vector. 

11. The method of claim 10, wherein the vector is an expression vector. 

12. The method of claim 1 1 , wherein the vector is a plasmid. 
25 1 3. The method of claim 1 1 , wherein the vector is a virus. 

14. The method of claim 13, wherein the virus is an adenovirus 

15. The method of claim 13, wherein the virus is a retrovirus. 

16. The method of claim 9, wherein the osteogenic precursor cells or 
peripheral blood leukocytes are transfected ex vivo. 

30 17. The method of claim 9, wherein the osteogenic precursor cells are 

transfected in vivo by direct injection of the isolated nucleic acid molecule. 
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18. The method of claim 9, wherein the LIM mineralization protein is 
HLMP-3 (SEQ ID NO: 40). 

1 9. A method of fusing a spine, comprising: 

(a) transfecting osteogenic precursor cells or peripheral blood 

5 leukocytes with an isolated nucleic acid molecule comprising SEQ ID NO: 39; 

(b) admixing the transfected osteogenic precursor cells or peripheral 
blood leukocytes with a matrix; and 

(c) contacting the matrix with the spine; 

wherein expression of the nucleotide sequence causes mineralized 
10 bone formation in the matrix. 

20. The method of claim 19, wherein the osteogenic precursor cells or 
peripheral blood cells are transfected ex vivo. 

'"21~A = method" of inducing systemic bone formation in a mammalian host 
in need thereof, comprising: 

15 a) transfecting osteogenic precursor cells or peripheral blood 

leukocytes with a vector that is stablely maintained in the osteogenic precursor 
cells or peripheral blood leukocytes, the vector comprising SEQ ID NO: 39 
operably linked to a regulatable promoter, wherein the regulatable promoter 
responds to an exogenous control compound; and 

20 (b) administering to the host, as needed, an amount of the 

exogenous control substance effective to cause expression of SEQ ID NO: 39. 
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22. A method of stimulating production of an osteogenic soluble factor 
by an osteogenic cell, comprising: 

(a) transfecting the osteogenic cell or peripheral blood leukocyte with 
an isolated nucleic acid molecule comprising SEQ ID NO: 39; and 
5 (b) overexpressing the isolated nucleic acid molecule. 

23. An osteogenic soluble factor produced by the method of claim 22. 

24. The osteogenic soluble factor of claim 23, wherein the osteogenic 
factor is a protein. 

25. A method of inhibiting the expression of HLMP-2 or HLMP-3 

10 comprising transfecting a cell wherein HLMP-2 or HLMP-3 is expressed with an 
antisense oligonucleotide. 

26. The method of claim of claim 17, wherein the isolated nucleic acid 
- molecule is in a vector selected from the group consisting of a plasmid and a 

virus. 

15 27. The method of claim 26, wherein the vector is a plasmid, which 

plasmid is directly injected into muscle tissue. 
28. A method of inhibiting bone formation comprising transfecting 
osteogenic precursor cells or peripheral blood leukocytes with an isolated 
nucleic acid molecule comprising SEQ ID NO: 37. 
20 29. The method of claim 28, wherein the isolated nucleic acid molecule 

is in a vector. 

30. The method of claim 29, wherein the vector is an expression vector. 

31 . The method of claim 30, wherein the vector is a plasmid. 

32. The method of claim 30, wherein the vector is a virus. 

25 33. The method of claim 32, wherein the virus is an adenovirus 

34. The method of claim 32, wherein the virus is a retrovirus. 

35. The method of claim 28, wherein the osteogenic precursor cells or 
peripheral blood leukocytes are transfected ex vivo. 

36. The method of claim 28, wherein the osteogenic precursor cells are 
30 transfected in vivo by direct injection of the isolated nucleic acid molecule. 
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37. The method of claim 28, wherein the LIM mineralization protein is 
HLMP-2 (SEQ ID NO: 38). 
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SEQUENCE LISTING 



<110> Boden M.D. , Scott D 

Hair Ph.D., Gregory A 

<120> LIM Mineralization Protein Splice Variants 

<130> splice variant LMP 

<140> 6148.0222-00304 
<141> 2000-04-28 

<150> 60/132,021 
<151> 1999-04-30 

<160> 42 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 457 

<212> PRT ~~ ' 

<213> Rattus norvegicus 

<400> 1 

Met Asp Ser Phe Lys Val Val Leu Glu Gly Pro Ala Pro Trp Gly Phe 
15 10 15 

Arg Leu Gin Gly Gly Lys Asp Phe Asn Val Pro Leu Ser He Ser Arg 
20 25 30 

Leu Thr Pro Gly Gly Lys Ala Ala Gin Ala Gly Val Ala Val Gly Asp 
35 40 45 

Trp Val Leu Ser He Asp Gly Glu Asn Ala Gly Ser Leu Thr His He 
50 55 60 

Glu Ala Gin Asn Lys He Arg Ala Cys Gly Glu Arg Leu Ser Leu Gly 
65 70 75 80 

Leu Ser Arg Ala Gin Pro Ala Gin Ser Lys Pro Gin Lys Ala Leu Thr 
85 90 95 

Pro Pro Ala Asp Pro Pro Arg Tyr Thr Phe Ala Pro Ser Ala Ser Leu 
100 105 110 

Asn Lys Thr Ala Arg Pro Phe Gly Ala Pro Pro Pro Thr Asp Ser Ala 
115 120 125 

Leu Ser Gin Asn Gly Gin Leu Leu Arg Gin Leu Val Pro Asp Ala Ser 
130 135 140 

Lys Gin Arg Leu Met Glu Asn Thr Glu Asp Trp Arg Pro Arg Pro Gly 
145 150 155 160 
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Thr Gly Gin Ser Arg 
165 

Glu Phe Met Gin Asp 
180 

Val Pro Arg Thr Glu 
195 

Ser Trp Pro Gly Pro 
210 

Ala Val Asp Pro Ala 
225 

Thr Val Leu Thr Arg 
245 

Asn Arg Thr Ser lie 
260 

Gly Ser Asn Asn Gly 
275 

lie Arg Gly Arg Tyr 
290 

Glu Phe Val Cys Ser 
305 

Phe Glu Glu Lys Gly 
325 

Tyr Ala Pro Ser Cys 
340 

Met His Ala Leu Lys 
355 

Ala Cys Lys Thr Pro 
370 

Ala Pro Tyr Cys Glu 
385 

Arg Gly Cys Asp Phe 
405 

Leu Gly Phe Ser Trp 
420 

lie Asn Leu Glu Gly 
435 

Cys Lys Ser His Ala 
450 



Ser Phe Arg lie Leu Ala 
170 

Pro Asp Glu Glu Phe Met 
185 

Ala Pro Ala Pro Ala Ser 
200 

Thr Thr Pro Ser Pro Thr 
215 

Phe Ala Glu Arg Tyr Ala 
230 235 

His Ser Gin Pro Ala Thr 
250 

Val Gin Ala Ala Ala Gly 
265 

Lys Thr Pro Val Cys His 

- 280 — 

Leu Val Ala Leu Gly His 
295 

Gin Cys Gly Lys Val Leu 
310 315 

Ala lie Phe Cys Pro Ser 
330 

Ala Lys Cys Lys Lys Lys 
345 

Met Thr Trp His Val Pro 
360 

lie Arg Asn Arg Ala Phe 
375 

Arg Asp Tyr Glu Lys Met 
390 395 

Lys lie Asp Ala Gly Asp 
410 

His Asp Thr Cys Phe Val 
425 

Lys Thr Phe Tyr Ser Lys 
440 

Phe Ser His Val 
455 



His Leu Thr Gly Thr 
175 

Lys Lys Ser Ser Gin 
190 

Thr lie Pro Gin Glu 
205 

Ser Arg Pro Pro Trp 
220 

Pro Asp Lys Thr Ser 
240 

Pro Thr Pro Leu Gin 
255 

Gly Gly Thr Gly Gly 
270 

Gin Cys His Lys lie 
285 

Ala Tyr His Pro Glu 
300 

Glu Glu Gly Gly Phe 
320 

Cys Tyr Asp Val Arg 
335 

lie Thr Gly Glu lie 
350 

Cys Phe Thr Cys Ala 
365 

Tyr Met Glu Glu Gly 
380 

Phe Gly Thr Lys Cys 
400 

Arg Phe Leu Glu Ala 
415 

Cys Ala He Cys Gin 
430 

Lys Asp Lys Pro Leu 
445 



2 
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<210> 2 
<211> 1696 
<212> DNA 

<213> Rattus norvegicus 
<400> 2 

gcacgaggat cccagcgcgg ctcctggagg ccgccaggca gccgcccagc cgggcattca 60 

ggagcaggta ccatggattc cttcaaggta gtgctggagg gacctgcccc ttggggcttc 12 0 

cgtctgcaag ggggcaagga cttcaacgtg cccctctcca tctctcggct cactcctgga 180 

ggcaaggccg cacaggccgg tgtggccgtg ggagactggg tactgagtat cgacggtgag 24 0 

aacgccggaa gcctcacaca cattgaagcc cagaacaaga tccgtgcctg tggggagcgc 300 

ctcagcctgg gtcttagcag agcccagcct gctcagagca aaccacagaa ggccctgacc 360 

cctcccgccg accccccgag gtacactttt gcaccaagcg cctccctcaa caagacggcc 420 

cggcccttcg gggcaccccc acctactgac agcgccctgt cgcagaatgg acagctgctc 480 

agacagctgg tccctgatgc cagcaagcag cggctgatgg agaatactga agactggcgc 54 0 

ccgcggccag ggacaggcca gtcccgttcc ttccgcatcc ttgctcacct cacgggcaca 600 

gagttcatgc aagacccgga tgaggaattc atgaagaagt caagccaggt gcccaggaca 660 

gaagccccag ccccagcctc aaccataccc caggaatcct ggcctggccc caccaccccc 720 

zgccccacca gccgcccacc ctgggccgta gatcctgcat ttgctgagcg ctatgcccca 780 

gacaaaacca gcacagtgct gacccgacac agccagccag ccacacctac gcctctgcag 840 

aaccgcacct ccatagttca ggctgcagct ggagggggca caggaggagg cagcaacaat 900 

ggcaagacgc ctgtatgcca ccagtgccac aagatcatcc=gcggccgata cctggtagca 960 

ctgggccacg cgtaccatcc tgaggaattt gtgtgcagcc agtgtgggaa ggtcctggaa 1020 

gagggtggct tcttcgagga gaagggagct atcttttgcc cctcctgcta tgatgtgcgc 1080 

tatgcaccca gctgtgccaa atgcaagaag aagatcactg gagagatcat gcatgcgctg 114 0 

aagatgacct ggcatgttcc ctgcttcacc tgtgcagcct gcaaaacccc tatccgcaac 1200 

agggctttct acatggagga gggggctccc tactgcgagc gagattacga gaagatgttt 1260 

ggcacaaagt gtcgcggctg tgacttcaag atcgatgccg gggaccgttt cctggaagcc 1320 

ctgggtttca gctggcatga tacgtgtttt gtttgcgcaa tatgtcaaat caacttggaa 1380 

ggaaagacct tctactccaa gaaggacaag cccctgtgca agagccatgc cttttcccac 144 0 
gtatgagcac ctcctcacac tactgccacc ctactctgcc agaagggtga taaaatgaga 1500 

gagctctctc tccctcgacc tttctgggtg gggctggcag ccattgtcct agccttggct 1560 ^ 

cctggccaga tcctggggct ccctcctcac agtccccttt cccacacttc ctccaccacc 162 0 
accaccgtca ctcacaggtg ctagcctcct agccccagtt cactctggtg tcacaataaa 1680 
cctgtatgta gctgtg 1696 



<210> 3 
<211> 260 
<212> DNA 

<213> Rattus norvegicus 
<400> 3 

ttctacatgg aggagggggc tccctactgc 
aagtgtcgcg gctgtgactt caagatcgat 
ttcagctggc atgatacgtg ttttgtttgc 
accttctact ccaagaagga caagcccctg 
gcacctcctc acactactgc 



<210> 4 
<211> 16 
<212> DNA 
<213> MMLV 



gagcgagatt acgagaagat gtttggcaca 60 
gccggggacc gtttcctgga agccctgggt 12 0 
gcaatatgtc aaatcaactt ggaaggaaag 180 
tgcaagagcc atgccttttc ccacgtatga 24 0 

260 



3 
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<400> 4 

aagctttttt tttttg 16 



<210> 5 
<211> 13 
<212> DNA 
<213> MMLV 

<400> 5 

aagcttggct atg 13 



<210> 6 

<211> 223 

<212> DNA 

<213> Homo sapiens 

<400> 6 

atccttgctc acctcacggg caccgagttc atgcaagacc cggatgagga gcacctgaag 60 

aaatcaagcc aggtgcccag gacagaagcc ccagccccag cctcatctac accccaggag 12 0 

ccctggcctg gccctaccgc ccccagccct accagccgcc cgccctgggc tgtggaccct 180 

gcgtttgccg agcgctatgc cccagacaaa accagcacag tgc 223 = 



<210> 7 

<211> 717 

<212> DNA 

<213> Homo sapiens 

<400> 7 

atggattcct tcaaggtagt gctggagggg ccagcacctt ggggcttccg gctgcaaggg 6 0 
ggcaaggact tcaatgtgcc cctctccatt tcccggctca ctcctggggg caaagcggcg 120 
caggccggag tggccgtggg tgactgggtg ctgagcatcg atggcgagaa tgcgggtagc 18 0 
ctcacacaca tcgaagctca gaacaagatc cgggcctgcg gggagcgcct cagcctgggc 24 0 
ctcagcaggg cccagccggt tcagagcaaa ccgcagaagg cctccgcccc ccTcegcggac 300 
cctccgcggt acacctttgc acccagcgtc tccctcaaca agacggcccg gccctttggg 360 
gcgcccccgc ccgctgacag cgccccgcaa cagaatggac agccgctccg . accgctggtc 42 0 
ccagatgcca gcaagcagcg gctgatggag aacacagagg actggcggcc gcggccgggg 480 
acaggccagt cgcgttcctt ccgcatcctt gcccacctca caggcaccga gttcatgcaa 54 0 
gacccggatg aggagcacct gaagaaatca agccaggtgc ccaggacaga agccccagcc 600 
ccagcctcat ctacacccca ggagccctgg cctggcccta ccgcccccag ccctaccagc 66 0 
cgcccgccct gggctgtgga ccctgcgttt gccgagcgct atgccccgga caaaacg 717 



<210> 8 

<211> 1488 

<212> DNA 

<213> Homo sapiens 

<400> 8 

atcgatggcg agaatgcggg tagcctcaca cacatcgaag ctcagaacaa gatccgggcc 60 

tgcggggagc gcctcagcct gggcctcagc agggcccagc cggttcagag caaaccgcag 120 

aaggcctccg cccccgccgc ggaccctccg cggtacacct ttgcacccag cgtctccctc 180 

aacaagacgg cccggccctt tggggcgccc ccgcccgctg acagcgcccc gcaacagaat 24 0 

ggacagccgc tccgaccgct ggtcccagat gccagcaagc agcggctgat ggagaacaca 3 00 
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gaggactggc ggccgcggcc ggggacaggc 
ctcacaggca ccgagttcat gcaagacccg 
gtgcccagga cagaagcccc agccccagcc 
cctaccgccc ccagccctac cagccgcccg 
cgctatgccc cggacaaaac gagcacagtg 
acgccgctgc agagccgcac ctccattgtg 
ggcagcaaca acggcaagac tcccgtgtgt 
tacctggtgg cgttgggcca cgcgtaccac 
aaggtcctgg aagagggtgg cttctttgag 
tatgacgtgc gctatgcacc cagctgtgcc 
atgcacgccc tgaagatgac ctggcacgtg 
cccatccgga acagggcctt ctacatggag 
gagaagatgt ttggcacgaa atgccatggc 
ttcctggagg ccctgggctt cagctggcat 
atcaacctgg aaggaaagac cttctactcc 
gccttctctc atgtgtgagc cccttctgcc 
gggcctggag tcgtggccct gcatttctgg 
gctcctggcc cgagcctggg ctcccgggcc 
ctccctccac caccacagca caccggtgct 
ccacaataaa cctgtaccca gctgaattcc 



cagtcgcgtt ccttccgcat ccttgcccac 360 
gatgaggagc acctgaagaa atcaagccag 420 
tcatctacac cccaggagcc ctggcctggc 480 
ccctgagctg tggaccctgc gtttgccgag 54 0 
ctgacccggc acagccagcc ggccacgccc 600 
caggcagctg ccggaggggt gccaggaggg 660 
caccagtgcc acaaggtcat ccggggccgc 720 
ccggaggagt ttgtgtgtag ccagtgtggg 780 
gagaagggcg ccatcttctg cccaccatgc 840 
aagtgcaaga agaagattac aggcgagatc 900 
cactgcttta cctgtgctgc ctgcaagacg 960 
gagggcgtgc cctattgcga gcgagactat 1020 
tgtgacttca agatcgacgc tggggaccgc 1080 
gacacctgct tcgtctgtgc gatatgtcag 114 0 
aagaaggaca ggcctctctg caagagccat 1200 
cacagctgcc gcggtggccc ctagcctgag 126 0 
9 ta 99<?ctgg caatggttgc cttaaccctg 1320 
cctgcccacc caccttatcc tcccacccca 1380 
ggccacacca gccccctttc acctccagtg 1440 
aaaaaatcca aaaaaaaa 14 88 



<210> 9 - • 

<211> 1644 

<212> DNA 

<213> Homo sapiens 

<400> 9 

atggattcct tcaaggtagt gctggagggg 
ggcaaggact tcaatgtgcc cctctccatt 
caggccggag tggccgtggg tgactgggtg 
ctcacacaca tcgaagctca gaacaagatc 
ctcagcaggg cccagccggt tcagagcaaa 
cctccgcggt acacctttgc acccagcgtc 
gcgcccccgc ccgctgacag cgccccgcaa 
ccagatgcca gcaagcagcg gctgatggag 
acaggccagt cgcgttcctt ccgcatcctt 
gacccggatg aggagcacct gaagaaatca 
ccagcctcat ctacacccca ggagccctgg 
cgcccgccct gggctgtgga ccctgcgttt 
acagtgctga cccggcacag ccagccggcc 
attgtgcagg cagctgccgg aggggtgcca 
gtgtgtcacc agtgccacaa ggtcatccgg 
taccacccgg aggagtttgt gtgtagccag 
tttgaggaga agggcgccat cttctgccca 
tgtgccaagt gcaagaagaa gattacaggc 
cacgtgcact gctttacctg tgctgcctgc 
atggaggagg gcgtgcccta ttgcgagcga 
catggctgtg acttcaagat cgacgctggg 
tggcatgaca cctgcttcgt ctgtgcgata 
tactccaaga aggacaggcc tctctgcaag 
tctgcccaca gctgccgcgg tggcccctag 
ttctgggtag ggctggcaat ggttgcctta 
cgggcccctg cccacccacc ttatcctccc 
ggtgctggcc acaccagccc cctttcacct 
aattccaaaa aatccaaaaa aaaa 



ccagcacctt ggggcttccg gctgcaaggg 60 
tcccggctca ctcctggggg caaagcggcg 12 0 
ctgagcatcg atggcgagaa tgcgggtagc 18 0 
cgggcctgcg gggagcgcct cagcctgggc 24 0 
ccgcagaagg cctccgcccc cgccgcggac 300 
tccctcaaca agacggcccg gccctttggg 360 
cagaatggac agccgctccg accgctggtc 420 
aacacagagg actggcggcc gcggccgggg 480 
gcccacctca caggcaccga gttcatgcaa 54 0 
agccaggtgc ccaggacaga agccccagcc 600 
cctggcccta ccgcccccag ccctaccagc 660 
gccgagcgct atgccccgga caaaacgagc 720 
acgcccacgc cgctgcagag ccgcacctcc 780 
gg a ggggg ca gcaacaacgg caagactccc 840 
ggccgctacc tggtggcgtt gggccacgcg 900 
tgtgggaagg tcctggaaga gggtggcttc 960 
ccatgctatg acgtgcgcta tgcacccagc 1020 
gagatcatgc acgccctgaa gatgacctgg 1080 
aagacgccca tccggaacag ggccttctac 1140 
gactatgaga agatgtttgg cacgaaatgc 1200 
gaccgcttcc tggaggccct gggcttcagc 1260 
tgtcagatca acctggaagg aaagaccttc 1320 
agccatgcct tctctcatgt gtgagcccct 1380 
cctgaggggc ctggagtcgt ggccctgcat 1440 
accctggctc ctggcccgag cctgggctcc 1500 
accccactcc ctccaccacc acagcacacc 1560 
ccagtgccac aataaacctg tacccagctg 1620 

1644 



5 
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<210> 10 

<211> 457 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Met Asp Ser Phe Lys Val Val Leu Glu Gly Pro Ala Pro Trp Gly Phe 
15 10 15 

Arg Leu Gin Gly Gly Lys Asp Phe Asn Val Pro Leu Ser lie Ser Arg 
20 25 30 

Leu Thr Pro Gly Gly Lys Ala Ala Gin Ala Gly Val Ala Val Gly Asp 
35 40 45 

Trp Val Leu Ser lie Asp Gly Glu Asn Ala Gly Ser Leu Thr His lie 
50 55 60 

Glu Ala Gin Asn Lys lie Arg Ala Cys Gly Glu Arg Leu Ser Leu Gly 
65 70 75 80 

Leu Ser Arg Ala Gin Pro Val Gin Ser Lys Pro Gin Lys Ala Ser Ala 
85 90 95 

Pro Ala Ala Asp Pro Pro Arg Tyr Thr Phe Ala Pro Ser Val Ser Leu 
100 105 110 

Asn Lys Thr Ala Arg Pro Phe Gly Ala Pro Pro Pro Ala Asp Ser Ala 
115 120 125 

Pro Gin Gin Asn Gly Gin Pro Leu Arg Pro Leu Val Pro Asp Ala Ser 
130 135 140 

Lys Gin Arg Leu Met Glu Asn Thr Glu Asp Trp Arg Pro Arg Pro Gly 
145 150 155 160 

Thr Gly Gin Ser Arg Ser Phe Arg lie Leu Ala His Leu Thr Gly Thr 
165 170 175 

Glu Phe Met Gin Asp Pro Asp Glu Glu His Leu Lys Lys Ser Ser Gin 
180 185 190 

Val Pro Arg Thr Glu Ala Pro Ala Pro Ala Ser Ser Thr Pro Gin Glu 
195 200 205 

Pro Trp Pro Gly Pro Thr Ala Pro Ser Pro Thr Ser Arg Pro Pro Trp 
210 215 220 

Ala Val Asp Pro Ala Phe Ala Glu Arg Tyr Ala Pro Asp Lys Thr Ser 
225 230 235 240 

Thr Val Leu Thr Arg His Ser Gin Pro Ala Thr Pro Thr Pro Leu Gin 
245 250 255 

Ser Arg Thr Ser lie Val Gin Ala Ala Ala Gly Gly Val Pro Gly Gly 

6 
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260 265 270 

Gly Ser Asn Asn Gly Lys Thr Pro Val Cys His Gin Cys His Lys Val 
275 280 285 

lie Arg Gly Arg Tyr Leu Val Ala Leu Gly His Ala Tyr His Pro Glu 
290 295 300 

Glu Phe Val Cys Ser Gin Cys Gly Lys Val Leu Glu Glu Gly Gly Phe 
305 310 315 320 

Phe Glu Glu Lys Gly Ala He Phe Cys Pro Pro Cys Tyr Asp Val Arg 
325 330 335 

Tyr Ala Pro Ser Cys Ala Lys Cys Lys Lys Lys He Thr Gly Glu He 
340 345 350 

Met His Ala Leu Lys Met Thr Trp His Val His Cys Phe Thr Cys Ala 
355 360 365 

Ala Cys Lys Thr Pro He Arg Asn Arg Ala Phe Tyr Met Glu Glu Gly 
370 375 380 

Val Pro Tyr Cys Glu Arg Asp Tyr Glu Lys Met Phe Gly Thr Lys Cys 
385 390 395 400 

His Gly Cys Asp Phe Lys He Asp Ala Gly Asp Arg Phe Leu Glu Ala 
405 410 415 

Leu Gly Phe Ser Trp His Asp Thr Cys Phe Val Cys Ala He Cys Gin 
420 425 430 

He Asn Leu Glu Gly Lys Thr Phe Tyr Ser Lys Lys Asp Arg Pro Leu 
435 440 445 

Cys Lys Ser His Ala Phe Ser His Val 
450 455 



<210> 11 
<211> 22 
<212> DNA 

<213> Rattus norvegicus 
<400> 11 

gccagggttt tcccagtcac ga 22 



<210> 12 
<211> 22 
<212> DNA 

<213> Rattus norvegicus 
<400> 12 

gccagggttt tcccagtcac ga 22 
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<210> 13 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 13 

tcttagcaga gcccagcctg ct 22 



<210> 14 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 14 

gcatgaactc tgtgcccgtg ag 22 



<210> 15 
<211> 20 
<212> DNA 

<213> Rattus norvegicus 
<400> 15 

atccttgctc acctcacggg 20 



<210> 16 
<211> 22 
<212> DNA 

<213> Rattus norvegicus 
<400> 16 

gcactgtgct ggttttgtct gg 22 



<210> 17 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 17 

catggattcc ttcaaggtag tgc 23 



<210> 18 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 18 

gttttgtctg gggcagagcg 20 



<210> 19 
<211> 44 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: adaptor for 
.Marathon RACE reactions 

<400> 19 

ctaatacgac tcactatagg gctcgagcgg ccgcccgggc aggt 44 



<210> 20 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
specific for Marathon RACE adaptor 

<400> 20 

ccatcctaat acgactcact atagggc 



27 



<210> 21 

<211> 765 

<212> DNA 

<213> Homo sapiens 



<400> 21 

ccgttgtttg taaaacgacg cagagcagcg 
atggattcct tcaaggtagt gctggagggg 
ggcaaggact tcaatgtgcc ctcctccatt 
ca gg cc gg a g tggccgtaag tgactgggtg 
ctcacacaca tcgaagctca gaacaagatc 
ctcaacaggg cccagccggt tcagaacaaa 
cctccgcggt acacctttgc accaagcgtc 
gcgcccccgc ccgctgacag cgccccgcag 
ccagatgcca gcaagcagcg gctgatggag 
acaggccagt gccgttcctt tcgcatcctt 
gacccggatg aggagcacct gaagaaatca 
ccagcctcat ctacacccca ggagccctgg 
cgcccgccct gggctgtgga ccctgcgttt 



ccctggccgg gccaagcagg agccggcatc 60 
ccagcacctt ggggcttccg gctgcaaggg 120 
tcccggctca cctctggggg caaggccgtg 180 
ctgagcatcg atggcgagaa tgcgggtagc 240 
cgggcctgcg gggagcgcct cagcctgggc 300 
ccgcaaaagg cctccgcccc cgccgcggac 360 
tccctcaaca agacggcccg gcccttgggg 420 
cagaatggac agccgctccg accgctggtc 4 80 
aacacagagg actggcggcc gcggccgggg 54 0 
gctcacctta caggcaccga gttcatgcaa 6 00 
agccaggtgc ccaggacaga agccccagcc 660 
cctggcccta ccgcccccag ccctaccagc 720 
gccgagcgct atgcc 765 



<210> 22 

<211> 1689 

<212> DNA 

<213> Homo sapiens 

<400> 22 

cgacgcagag cagcgccctg gccgggccaa 
gtagtgctgg aggggccagc accttggggc 
gtgcccctct ccatttcccg gctcactcct 
gtgggtgact gggtgctgag catcgatggc 
gctcagaaca agatccgggc ctgcggggag 



gcaggagccg gcatcatgga ttccttcaag 60 

ttccggctgc aagggggcaa ggacttcaat 120 

gggggcaaag cggcgcaggc cggagtggcc 180 

gagaatgcgg gtagcctcac acacatcgaa 240 

cgcctcagcc tgggcctcag cagggcccag 300 
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ccggttcaga gcaaaccgca gaaggcctcc 
tttgcaccca gcgtctccct caacaagacg 
gacagcgccc cgcaacagaa tggacagccg 
cagcggctga tggagaacac agaggactgg 
tccttccgca tccttgccca cctcacaggc 
cacctgaaga aatcaagcca ggtgcccagg 
ccccaggagc cctggcctgg ccctaccgcc 
gtggaccctg cgtttgccga gcgctatgcc 
cacagccagc cggccacgcc cacgccgctg 
gccggagggg tgccaggagg gggcagcaac 
cacaaggtca tccggggccg ctacctggtg 
tttgtgtgta gccagtgtgg gaaggtcctg 
gccatcttct gcccaccatg ctatgacgtg 
aagaagatta caggcgagat catgcacgcc 
acctgtgctg cctgcaagac gcccatccgg 
ccctattgcg agcgagacta tgagaagatg 
aagatcgacg ctggggaccg cttcctggag 
ttcgtctgtg cgatatgtca gatcaacctg 
aggcctctct gcaagagcca tgccttctct 
cgcggtggcc cctagcctga ggggcctgga 
gcaatggttg ccttaaccct ggctcctggc 
ccaccttatc ctcccacccc actccctcca 
agcccccttt cacctccagt gccacaataa 
aaaaaaaaa 



gcccccgccg cggaccctcc gcggtacacc 360 
gcccggccct ttggggcgcc cccgcccgct 420 
ctccgaccgc tggtcccaga tgccagcaag 4 80 
cggccgcggc cggggacagg ccagtcgcgt 54 0 
accgagttca tgcaagaccc ggatgaggag 600 
acagaagccc cagccccagc ctcatctaca 660 
cccagcccta ccagccgccc gccctgggct 720 
ccggacaaaa cgagcacagt gctgacccgg 780 
cagagccgca cctccattgt gcaggcagct 84 0 
aacggcaaga ctcccgtgtg tcaccagtgc 900 
gcgttgggcc acgcgtacca cccggaggag 960 
gaagagggtg gcttctttga ggagaagggc 1020 
cgctatgcac ccagctgtgc caagtgcaag 1080 
ctgaagatga cctggcacgt gcactgcttt 1140 
aacagggcct tctacatgga ggagggcgtg 1200 
tttggcacga aatgccatgg ctgtgacttc 1260 
gccctgggct tcagctggca tgacacctgc 1320 
gaaggaaaga ccttctactc caagaaggac 1380 
catgtgtgag ccccttctgc ccacagctgc 1440 
gtcgtggccc tgcatttctg ggtagggctg 1500 
ccgagcctgg gctcccgggc ccctgcccac 1560 
ccaccacagc acaccggtgc tggccacacc 1620 
acctgtaccc agctgaatte-caaaaaatce 1680 

1689 



<210> 23 
<211> 22 
<212> DNA 

<213> Homo sapiens 
<400> 23 

gcactgtgct cgttttgtcc gg 22 



<210> 24 
<211> 21 
<212> DNA 

<213> Homo sapiens 
<400> 24 

tccttgctca cctcacgggc a 21 



<210> 25 
<211> 30 
<212> DNA 

<213> Homo sapiens 
<400> 25 

tcctcatccg ggtcttgcat gaactcggtg 3 0 



<210> 26 
<211> 28 
<212> DNA 
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<213> Homo sapiens 
<400> 26 

gcccccgccc gctgacagcg ccccgcaa 28 



<210> 27 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 27 

tccttgctca cctcacgggc accg 24 



<210> 28 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 28 

gtaatacgac tcactatagg gc 22 



<210> 29 
<211> 23 
<212> DNA 

<213> Rattus norvegicus 
<400> 29 

gcggctgatg gagaatactg aag 23 



<210> 30 
<211> 23 
<212> DNA 

<213> Rattus norvegicus 
<400> 30 

atcttgtggc actggtggca tac 23 



<210> 31 
<211> 22 
<212> DNA 

<213> Rattus norvegicus 
<400> 31 

tgtgtcgggt cagcactgtg ct 22 



<210> 32 

<211> 1620 

<212> DNA 

<213> Homo sapiens 
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<400> 32 

atggattcct tcaaggtagt gctggagggg ccagcacctt ggggcttccg gctgcaaggg 60 
ggcaaggact tcaatgtgcc cctctccatt tcccggctca ctcctggggg caaagcggcg 120 
caggccggag tggccgtggg tgactgggtg ctgagcatcg atggcgagaa tgcgggtagc 180 
ctcacacaca tcgaagctca gaacaagatc cgggcctgcg gggagcgcct cagcctgggc 24 0 
ctcagcaggg cccagccggt tcagagcaaa ccgcagaagg cctccgcccc cgccgcggac 300 
cctccgcggt acacctttgc acccagcgtc tccctcaaca agacggcccg gccctttggg 360 
gcgcccccgc ccgctgacag cgccccgcaa cagaatggac agccgctccg accgctggtc 420 
ccagatgcca gcaagcagcg gctgatggag aacacagagg actggcggcc gcggccgggg 480 
acaggccagt cgcgttcctt ccgcatcctt gcccacctca caggcaccga gttcatgcaa 54 0 
gacccggatg aggagcacct gaagaaatca agccaggtgc ccaggacaga agccccagcc 600 
ccagcctcat ctacacccca ggagccctgg cctggcccta ccgcccccag ccctaccagc 660 
cgcccgccct gagctgtgga ccctgcgttt gccgagcgct atgccccgga caaaacgagc 72 0 
acagtgctga cccggcacag ccagccggcc acgcccacgc cgctgcagag ccgcacctcc 780 
attgtgcagg cagctgccgg aggggtgcca ggagggggca gcaacaacgg caagactccc 84 0 
gtgtgtcacc agtgccacaa ggtcatccgg ggccgctacc tggtggcgtt gggccacgcg 900 
taccacccgg aggagtttgt gtgtagccag tgtgggaagg tcctggaaga gggtggcttc 960 
tttgaggaga agggcgccat cttctgccca ccatgctatg acgtgcgcta tgcacccagc 1020 
tgtgccaagt gcaagaagaa gattacaggc gagatcatgc acgccctgaa gatgacctgg 1080 
cacgtgcact gctttacctg tgctgcctgc aagacgccca tccggaacag ggccttctac 114 0 
atggaggagg gcgtgcccta ttgcgagcga gactatgaga agatgtttgg cacgaaatgc 1200 
catggctgtg acttcaagat cgacgctggg gaccgcttcc tggaggccct gggcttcagc 1260 
tggcatgaca cctgcttcgt ctgtgegata tgteagatca accfcggaagg aaagaccttc 1320 
tactccaaga aggacaggcc tctctgcaag agccatgcct tctctcatgt gtgagcccct 1380 
tctgcccaca gctgccgcgg tggcccctag cctgaggggc ctggagtcgt ggccctgcat 1440 
ttctgggtag ggctggcaat ggttgcctta accctggctc ctggcccgag cctgggctcc 1500 
cgggcccctg cccacccacc ttatcctccc accccactcc ctccaccacc acagcacacc 1560 
ggtgctggcc acaccagccc cctttcacct ccagtgccac aataaacctg tacccagctg 1620 



<210> 33 

<211> 1665 

<212> DNA 

<213> Homo sapiens 

<400> 33 

cgacgcagag cagcgccctg gccgggccaa gcaggagccg gcatcatgga tcccttcaag 60 
gtagtgctgg aggggccagc accttggggc ttccggctgc aagggggcaa ggacttcaat 120 

gtgcccctct ccatttcccg gctcactcct gggggcaaag cggcgcaggc cggagtggcc 180 

gtgggtgact gggtgctgag catcgatggc gagaatgcgg gtagcctcac acacatcgaa 240 

gctcagaaca agatccgggc ctgcggggag cgcctcagcc tgggcctcag cagggcccag 300 

ccggttcaga gcaaaccgca gaaggcctcc gcccccgccg cggaccctcc gcggtacacc 360 

tttgcaccca gcgtctccct caacaagacg gcccggccct ttggggcgcc cccgcccgct 420 

gacagcgccc cgcaacagaa tggacagccg ctccgaccgc tggtcccaga tgccagcaag 480 

cagcggctga tggagaacac agaggactgg cggccgcggc cggggacagg ccagtcgcgt 540 

tccttccgca tccttgccca cctcacaggc accgagttca tgcaagaccc ggatgaggag 600 

cacctgaaga aatcaagcca ggtgcccagg acagaagccc cagccccagc ctcatctaca 660 

ccccaggagc cctggcctgg ccctaccgcc cccagcccta ccagccgccc gccctgagct 720 

gtggaccctg cgtttgccga gcgctatgcc ccggacaaaa cgagcacagt gctgacccgg 780 

cacagccagc cggccacgcc cacgccgctg cagagccgca cctccattgt gcaggcagct 840 

gccggagggg tgccaggagg gggcagcaac aacggcaaga ctcccgtgtg tcaccagtgc 900 

cacaaggtca tccggggccg ctacctggtg gcgttgggcc acgcgtacca cccggaggag 960 

tttgtgtgta gccagtgtgg gaaggtcctg gaagagggtg gcttctttga ggagaagggc 1020 

gccatcttct gcccaccatg ctatgacgtg cgctatgcac ccagctgtgc caagtgcaag 1080 

aagaagatta caggcgagat catgcacgcc ctgaagatga cctggcacgt gcactgcttt 1140 

acctgtgctg cctgcaagac gcccatccgg aacagggcct tctacatgga ggagggcgtg 1200 
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ccctattgcg agcgagacta tgagaagatg tttggcacga aatgccatgg ctgtgacttc 1260 

aagatcgacg ctggggaccg cttcctggag gccctgggct tcagctggca tgacacctgc 1320 

ttcgtctgtg cgatatgtca gatcaacctg gaaggaaaga ccttctactc caagaaggac 1380 

aggcctctct gcaagagcca tgccttctct catgtgtgag ccccttctgc ccacagctgc 1440 

cgcggtggcc cctagcctga ggggcctgga gtcgtggccc tgcatttctg ggtagggctg 1500 

gcaatggttg ccttaaccct ggctcctggc ccgagcctgg gctcccgggc ccctgcccac 1560 

ccaccttatc ctcccacccc actccctcca ccaccacagc acaccggtgc tggccacacc 1620 

agcccccttt cacctccagt gccacaataa acctgtaccc agctg 1665 



<210> 34 
<211> 223 
<212> PRT 

<213> Homo sapiens 
<400> 34 

Met Asp Ser Phe Lys Val Val Leu Glu Gly Pro Ala Pro Trp Gly Phe 
15 10 15 

Arg Leu Gin Gly Gly Lys Asp Phe Asn Val Pro Leu Ser lie Ser Arg 
20 25 30 

Leu Thr Pro Gly Gly- Lys Ala Ala Gin Ala -Gly Val Ala Val Gly Asp 
35 40 45 

Trp Val Leu Ser lie Asp Gly Glu Asn Ala Gly Ser Leu Thr His lie 
50 55 60 

Glu Ala Gin Asn Lys lie Arg Ala Cys Gly Glu Arg Leu Ser Leu Gly 
65 70 75 80 

Leu Ser Arg Ala Gin Pro Val Gin Ser Lys Pro Gin Lys Ala Ser Ala 
85 90 95 

Pro Ala Ala Asp Pro Pro Arg Tyr Thr Phe Ala Pro Ser Val Ser Leu 
100 105 110 

Asn Lys Thr Ala Arg Pro Phe Gly Ala Pro Pro Pro Ala Asp Ser Ala 
115 120 125 

Pro Gin Gin Asn Gly Gin Pro Leu Arg Pro Leu Val Pro Asp Ala Ser 
130 135 140 

Lys Gin Arg Leu Met Glu Asn Thr Glu Asp Trp Arg Pro Arg Pro Gly 
145 150 155 160 

Thr Gly Gin Ser Arg Ser Phe Arg lie Leu Ala His Leu Thr Gly Thr 
165 170 175 

Glu Phe Met Gin Asp Pro Asp Glu Glu His Leu Lys Lys Ser Ser Gin 
180 185 190 

Val Pro Arg Thr Glu Ala Pro Ala Pro Ala Ser Ser Thr Pro Gin Glu 
195 200 205 

Pro Trp Pro Gly Pro Thr Ala Pro Ser Pro Thr Ser Arg Pro Pro 
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210 215 220 



<210> 35 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 35 

gagccggcat catggattcc 20 



<210> 36 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 36 

gctgcctgca caatggaggt 20 



<210> 37 

<211> 1456 *=—- - - • 

<212> DNA 

<213> Homo sapiens 
<400> 37 

cgacgcagag cagcgccctg gccgggccaa gcaggagccg gcatcatgga ttccttcaag 6 0 
gtagtgctgg aggggccagc accttggggc ttccggctgc aagggggcaa ggacttcaat 120 
gtgcccctct ccatttcccg gctcactcct gggggcaaag cggcgcaggc cggagtggcc 18 0 
gtgggtgact gggtgctgag catcgatggc gagaatgcgg gtagcctcac acacatcgaa 240 
gctcagaaca agatccgggc ctgcggggag cgcctcagcc tgggcctcag cagggcccag 3 00 
ccggttcaga gcaaaccgca gaaggtgcag acccctgaca aacagccgct ccgaccgctg 360 
gtcccagatg ccagcaagca gcggctgatg gagaacacag aggactggcg gccgcggccg 42 0 
gggacaggcc agtcgcgttc cttccgcatc cttgcccacc tcacaggcac cgagttcatg 480 
caagacccgg atgaggagca cctgaagaaa tcaagccagg tgcccaggac agaagcccca 54 0 
gccccagcct catctacacc ccaggagccc tggcctggcc ctaccgcccc cagccctacc 600 
agccgcccgc cctgggctgt ggaccctgcg tttgccgagc gctatgcccc ggacaaaacg 660 
agcacagtgc tgacccggca cagccagccg gccacgccca cgccgctgca gagccgcacc 72 0 
tccattgtgc aggcagctgc cggaggggtg ccaggagggg gcagcaacaa cggcaagact 780 
cccgtgtgtc accagtgcca caaggtcatc cggggccgct acctggtggc gttgggccac 84 0 
gcgtaccacc cggaggagtt tgtgtgtagc cagtgtggga aggtcctgga agagggtggc 900 
ttctttgagg agaagggcgc catcttctgc ccaccatgct atgacgtgcg ctatgcaccc 960 
agctgtgcca agtgcaagaa gaagattaca ggcgagatca tgcacgccct gaagatgacc 1020 
tggcacgtgc actgctttac ctgtgctgcc tgcaagacgc ccatccggaa cagggccttc 1080 
tacatggagg agggcgtgcc ctattgcgag cgagactatg agaagatgtt tggcacgaaa 114 0 
tgccatggct gtgacttcaa gatcgacgct ggggaccgct tcctggaggc cctgggcttc 1200 
agctggcatg acacctgctt cgtctgtgcg atatgtcaga tcaacctgga aggaaagacc 126 0 
ttctactcca agaaggacag gcctctctgc aagagccatg ccttctctca tgtgtgagcc 1320 
ccttctgccc acagctgccg cggtggcccc tagcctgagg ggcctggagt cgtggccctg 1380 
catttctggg tagggctggc aatggttgcc ttaaccctgg ctcctggccc gagcctgggc 1440 
tcccgggccc tgccca 14 56 



<210> 38 
<211> 423 
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<212> PRT 

<213> Homo sapiens 
<400> 38 

Met Asp Ser Phe Lys Val Val Leu Glu Gly Pro Ala Pro Trp Gly Phe 
15 10 15 

Arg Leu Gin Gly Gly Lys Asp Phe Asn Val Pro Leu Ser lie Ser Arg 
20 25 30 

Leu Thr Pro Gly Gly Lys Ala Ala Gin Ala Gly Val Ala Val Gly Asp 
35 40 45 

Trp Val Leu Ser He Asp Gly Glu Asn Ala Gly Ser Leu Thr His He 
50 55 60 

Glu Ala Gin Asn Lys He Arg Ala Cys Gly Glu Arg Leu Ser Leu Gly 
65 70 75 80 

Leu Ser Arg Ala Gin Pro Val Gin Asn Lys Pro Gin Lys Val Gin Thr 
85 90 95 

Pro Asp Lys Gin Pro Leu Arg -Pro Leu Val Pro Asp Ala Ser Lys Gin 
100 105 110 

Arg Leu Met Glu Asn Thr Glu Asp Trp Arg Pro Arg Pro Gly Thr Gly 
115 120 125 

Gin Ser Arg Ser Phe Arg He Leu Ala His Leu Thr Gly Thr Glu Phe 
130 135 140 

Met Gin Asp Pro Asp Glu Glu His Leu Lys Lys Ser Ser Gin Val Pro 
145 150 155 160 

Arg Thr Glu Ala Pro Ala Pro Ala Ser Ser Thr Pro Gin Glu Pro Trp 
165 170 175 

Pro Gly Pro Thr Ala Pro Ser Pro Thr Ser Arg Pro Pro Trp Ala Val 
180 185 190 

Asp Pro Ala Phe Ala Glu Arg Tyr Ala Pro Asp Lys Thr Ser Thr Val 
195 200 205 

Leu Thr Arg His Ser Gin Pro Ala Thr Pro Thr Pro Leu Gin Ser Arg 
210 215 220 

Thr Ser He Val Gin Ala Ala Ala Gly Gly Val Pro Gly Gly Gly Ser 
225 230 235 240 

Asn Asn Gly Lys Thr Pro Val Cys His Gin Cys His Gin Val He Arg 
245 250 255 

Ala Arg Tyr Leu Val Ala Leu Gly His Ala Tyr His Pro Glu Glu Phe 
260 265 270 

Val Cys Ser Gin Cys Gly Lys Val Leu Glu Glu Gly Gly Phe Phe Glu 
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275 280 285 

Glu Lys Gly Ala lie Phe Cys Pro Pro Cys Tyr Asp Val Arg Tyr Ala 
290 295 300 

Pro Ser Cys Ala Lys Cys Lys Lys Lys lie Thr Gly Glu lie Met His 
305 310 315 320 

Ala Leu Lys Met Thr Trp His Val Leu Cys Phe Thr Cys Ala Ala Cys 
325 330 335 

Lys Thr Pro lie Arg Asn Arg Ala Phe Tyr Met Glu Glu Gly Val Pro 
340 345 350 

Tyr Cys Glu Arg Asp Tyr Glu Lys Met Phe Gly Thr Lys Cys Gin Trp 
355 360 365 

Cys Asp Phe Lys lie Asp Ala Gly Asp Arg Phe Leu Glu Ala Leu Gly 
370 375 380 

Phe Ser Trp His Asp Thr Cys Phe Val Cys Ala lie Cys Gin lie Asn 
385 390 395 400 

Leu Glu Gly Lys Thr Phe Tyr Ser Lys Lys Asp Arg Pro Leu Cys Lys 
405 410 415 

Ser His Ala Phe Ser His Val 
420 



<210> 39 

<211> 1575 

<212> DNA 

<213> Homo sapiens 

<400> 39 

cgacgcagag cagcgccctg gccgggccaa gcaggagccg gcatcatgga ttccttcaag 60 

gtagtgctgg aggggccagc accttggggc ttccggctgc aagggggcaa ggacttcaat 120 

gtgcccctct ccatttcccg gctcactcct gggggcaaag cggcgcaggc cggagtggcc 180 

gtgggtgact gggtgctgag catcgatggc gagaatgcgg gtagcctcac acacatcgaa 240 

gctcagaaca agatccgggc ctgcggggag cgcctcagcc tgggcctcag cagggcccag 300 

ccggttcaga gcaaaccgca gaaggcctcc gcccccgccg cggaccctcc gcggtacacc 360 

tttgcaccca gcgtctccct caacaagacg gcccggccct ttggggcgcc cccgcccgct 420 

gacagcgccc cgcaacagaa tgggtgcaga cccctgacaa acagccgctc cgaccgctgg 480 

tcccagatgc cagcaagcag cggctgatgg agaacacaga ggactggcgg ccgcggccgg 540 

ggacaggcca gtcgcgttcc ttccgcatcc ttgcccacct cacaggcacc gagttcatgc 600 

aagacccgga tgaggagcac ctgaagaaat caagccaggt gcccaggaca gaagccccag 660 

ccccagcctc atctacaccc caggagccct ggcctggccc taccgccccc agccctacca 720 

gccgcccgcc ctgggctgtg gaccctgcgt ttgccgagcg ctatgccccg gacaaaacga 780 

gcacagtgct gacccggcac agccagccgg ccacgcccac gccgctgcag agccgcacct 840 

ccattgtgca ggcagctgcc ggaggggtgc caggaggggg cagcaacaac ggcaagactc 900 

ccgtgtgtca ccagtgccac aaggtcatcc ggggccgcta cctggtggcg ttgggccacg 960 

cgtaccaccc ggaggagttt gtgtgtagcc agtgtgggaa ggtcctggaa gagggtggct 1020 

tctttgagga gaagggcgcc atcttctgcc caccatgcta tgacgtgcgc tatgcaccca 1080 

gctgtgccaa gtgcaagaag aagattacag gcgagatcat gcacgccctg aagatgacct 114 0 

ggcacgtgca ctgctttacc tgtgctgcct gcaagacgcc catccggaac agggccttct 1200 
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acatggagga gggcgtgccc tattgcgagc gagactatga gaagatgttt ggcacgaaat 1260 

gccatggctg tgacttcaag atcgacgctg gggaccgctt cctggaggcc ctgggcttca 1320 

gctggcatga cacctgcttc gtctgtgcga tatgtcagat caacctggaa ggaaagacct 1380 

tctactccaa gaaggacagg cctctctgca agagccatgc cttctctcat gtgtgagccc 1440 

cttctgccca cagctgccgc ggtggcccct agcctgaggg gcctggagtc gtggccctgc 1500 

atttctgggt agggctggca atggttgcct taaccctggc tcctggcccg agcctgggct 1560 

cccgggccct gccca 1575 



<210> 40 

<211> 153 

<212> PRT 

<213> Homo sapiens 

<400> 40 

Met Asp Ser Phe Lys Val Val Leu Glu Gly Pro Ala Pro Trp Gly Phe 
15 10 15 

Arg Leu Gin Gly Gly Lys Asp Phe Asn Val Pro Leu Ser lie Ser Arg 
20 25 30 

Leu Thr Pro Gly Gly Lys Ala Ala Gin Ala Gly Val Ala Val Gly Asp 

35 ' " = " 40 " = ~ ' 45 

Trp Val Leu Ser lie Asp Gly Glu Asn Ala Gly Ser Leu Thr His lie 
50 55 60 

Glu Ala Gin Asn Lys lie Arg Ala Cys Gly Glu Arg Leu Ser Leu Gly 
65 70 75 80 

Leu Ser Arg Ala Gin Pro Val Gin Ser Lys Pro Gin Lys Ala Ser Ala 
85 90 95 

Pro Ala Ala Asp Pro Pro Arg Tyr Thr Phe Ala Pro Ser Val Ser Leu 
100 105 110 

Asn Lys Thr Ala Arg Pro Phe Gly Ala Pro Pro Pro Ala Asp Ser Ala 
115 120 125 

Pro Gin Gin Asn Gly Cys Arg Pro Leu Thr Asn Ser Arg Ser Asp Arg 
130 135 140 

Trp Ser Gin Met Pro Ala Ser Ser Gly 
145 150 



<210> 41 

<211> 24740 

<212> DNA 

<213> Homo sapiens 

<400> 41 

nnnnnntgta ttttatcata ttttaaaaat 
tggaggttcg tgcctgtaat cccagcactt 
gtcaggagtt cgagaccagc ctgcccaata 
aaaaattagc caggcatggt ggtgggcacc 



caaaaaacaa aaggcagttg aggttaggca 60 
tgggaagccg aagcacgtgg atcacctgag 120 
tggtaaaacc ctgtctctac taaaaataca 180 
tgtaatccca gctacttggg agactgaggc 240 
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aggagaatca cttaaacccg ggaggcgggc tgggcgcggt ggctcatgcc tgtaatccca 300 
gcactttggg aggccgagac aggcggatca tgaggtcagg agatcgagat . catcctggct 360 
aacatggtga aaccccatct ctactaaaaa tacaaaaaaa attagccagg cctggtggcg 420 
ggcacctgta gtcccagcta cttgggaggc tgaggcagga gaatggcgtg aacctgggag 480 
gcggcgttgc agtgagccaa gatcgcgcca ctgcactcca gcctgggcga caagagtgag 540 
actccatctt aaagaaaaaa aacaaacccg ggaggcggaa attgcagtca gccgagatct 600 
cgccattgca ctcaagtatg ggtgacagag caagactcca tgtcaaaaaa aaaggcagtt 660 
gacaggagca aggagcctgg tgaggaagct gtggcatttg acccggctgt gttgctatgg 720 
gccagggtgg tgctagtaga ggagctgagt gggaaagagc acaggggaca tgctgaaggc 780 
ctgggtgtgg ggatgaggca gagattgggg gcaccttgca gggtcatagc aggtggctgt 84 0 
ggtgagatgg aggaagacac ctggggtact gctctaggct gtcagacata cagaagctgg 900 
cccagccaag cccaggggct gcaagggaca tccttttgtg tccccagtga tctgcagctc 960 
tcagacaccc tcaagcacag tgcctcttgc ccagcccagc actctcagtg gggagccagg 1020 
tgggagaaca ggctcggaag gggacctagg cttatgcagc gagccgggca aagctggaac 1080 
tggagcccag gcccctggat gccccctggc ttgtggagtt ctgggatact gaggggaggg 1140 
gacagggcat gggagtgcgg tgctctcacc tttgacttga actcattccc caggggacag 1200 
gggaggcctc ctcaggatcc acagatgccc agtctcccaa gaggggcctg gtccccatgg 1260 
aggaaaactc catctactcc tcctggcagg aaggtaagtt ggaggacgtg caagggcagc 1320 
ctcagccccc cacacccagg gctgggtctt tttgggactg acggagctgt cctggccacc 1380 
tgccacagtg ggcgagtttc ccgtggtggt gcagaggact gaggccgcca cccgctgcca 144 0 
gctgaagggg ccggccctgc tggtgctggg cccagacgcc atccagctga gggaggccaa 1500 
ggcacccagg ccctctacag ctggccctac cacttcctgc gcaagttcgg ctccgacaag 1560 
gtgaggtgca=ggggfegggaa=agggtgaggg-getgacagec tggaccctcc tgctaatccc 1620_ 
cacccgtgtg ccctgtgccc agggcgtgtt ctcctttgag gccggccgtc gctgccactc 1680 
gggtgagggc ctctttgcct tcagcacccc ctgtgcccct gacctgtgca gggctgtggc 174 0 
cggggccatc gccgccagcg ggagcggctg ccagagctga ccaggcccca gccctgcccc 1800 
ctgccacggg ccacctctct gccctccctg gacacccccg gagagcttcg ggagatgcca 1860 
ccaggacctg agccacccac gtccaggaaa atgcacctgg ccgagcccgg accccagagc 1920 
ctgccgctac tgctaggccc ggagcccaac gatctggcgt ccgggctcta cgcttcagtg 1980 
tgcaagcgtg ccagtgggcc cccaggcaat gagcacctct atgagaacct gtgtgtgctg 204 0 
gaggccagcc ccacgctgca cggtggggaa cctgagccgc acgagggccc cggcagccgc 2100 
agccccacaa ccagtcccat ctaccacaac ggccaggact tgagctggcc cggcccggcc 2160 
aacgacagta ccctggaggc ccagtaccgg cggctgctgg agctggatca ggtggagggc 2220 
acaggccgcc ctgaccctca ggcaggtttc aaggccaagc tggtgaccct gctgagtcgt 2280 
gagcggagga agggcccagc cccttgtgac cggccctgaa cgcccagcag agtggtggcc 2340 
agaggggaga ggtgctcccc ctgggacagg agggtgggct ggtgggcaaa cattgggccc 24 00 
atgcagacac acgcctgtgt ccaccctggc ctgcaggaac aaggcaggcc gcctgtggag 2460 
gacctcagcc ctgccctgcc ctcctcatga atagtgtgca gactcacaga taataaagct 2520 
cagagcagct cccggcaggg gcactcacgg cacacgcccc tgcccacgtt cattgcggcc 258 0 
aacacaagca ccctgtgccg gttccagggg cacaggtgac ctgggcctta cctgccaccc 264 0 
gtgggctcaa acccactgca gcagacagac gggatggaaa tcattaggac tccatgttgc 2700 
tctgcacggc cgagtgacac gaagaggcag gcggagggag ctgtgaggct tacttgtcag 2760 
actcaggaag gagcaacatg agggcccaac tggagacccg gaggcccgag ctgggaggag 2820 
gcagtggggg cggggtgcag gtggaaggga tttcagagac accctcgtcc aaaacacttg 2880 
ttccctgctg aaactccaac aatttgcaga tacttctggg aaccccaggc gtcagtctcc 294 0 
tcatctgtaa aggagagaga accgatgacg tatcaggcat aatccttgat gagagtttgc 3000 
tgcgtgccta ctcagtgcca ggcgctgggg gacacagccg tgttcaggac agccttggtc 306 0 
ctgttctccg ggagccgaca ttccaggggg agagaagttt cctgaagact tccatgctgc 3120 
gttccctcct ctgctcctgc tcctggcgcc atcctaggag ccagccatgc acgcaagcgt 318 0 
catgcctcca gggctctgac tgcccagccc ctcaccgcaa ctccacctca gctgcacaca 324 0 
cccttggcac atcctgaacc tcattttcat gacggacaca caatttttgc tctctcctgt 33 00 
ccaagcctca tcctctggcc gccacctcct tccagctcac ttcctttagt gcggccagta 3360 
ccgcccctgc ctaggcatgt cgacctgcag ggaccctttt ctggctcttc gaggcctctg 3420 
cccaccatcc cctctttgtt ctccatagtc ccttccccct gttctctctc gtttcatctt 3480 
actggtctgg caaagtcccc ggccttgggc gagccagacc tcctcagtgc ctgcacacag 354 0 
ctgcccacag ccagagaaat ccatttaagc agactgcctg catccttctt aacagtgcaa 3600 
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ggcaggcact ccctgccaca agagaccctg ttccctagta gggcagcttt tctcctcccc 3660 
agaacctcct gtctatcccc acccaatgtc tcctcacagg catattgggg aaacaggtca 3720 
ggctctccca ccgtatctgc aagtgtactg gcatccatct gtcttcttcc tacccctaca 3780 
gtagaaacag tgtctgtccc cagctgtgct ctgatcccgg ctcctttcac ctcagagctt 3840 
ggaaaattga gctgtcccca ctctctcctg cgcccattca tcctaccagc agcttttcca 3900 
gccacacgca aacatgctct gtaatttcac attttaaacc ttcccttgac ctcacattcc 3960 
tcttcggcca cctctgtttc tctgttcctc ttcacagcaa aaactgttca aaagagttgt 4020 
tgattacttt catttccact ttctcacccc cattctctcc tcaattaact ctccttcatc 4080 
cccatgatgc cattatgtgg cttttattag agtcaccaac cttattctcc aaaacaaaag 4140 
caacaaggac tttgacttct cagcagcact cagctctggt tcttgaaaca cccccgttac 4200 
ttgctattcc tcctacctca taacaatctc cttcccagcc tctactgctg ccttctctga 4260 
gttcttccca gggtcctagg ctcagatgta gtgtagctca accctgctac acaaagaatc 4 320 
tcctgaaagc ctgtaaaaat gtccatgcat gttctgtgag tgatctacca agaaaataaa 4380 
aaattttaaa aatcaaatgc ccatgcctgg gcccacacgc aggggctctg atttcatcag 444 0 
tctggtaggt gggttctggg catccacgct cactggattt ccggatgatt gtagtatgca 4500 
gcctaggctg ggaaccactg gcctcagcaa gccagtcatt ctccaggtgt cacagaccct 4 560 
ctaggtgcta atgaccccga aggtctgtct tcagtgcaca cctccccctg agctccagat 4620 
ttaggaatcc cactgcacac gagacatctg gatgtggaaa agacatctcc agatcccatg 4680 
ggtgaaaggg ggttggggga atggagactc gtgttcttcc aggatgtgtg tggacacaga 4740 
atgcaaagcc tggagggatg ctagagccat agggaggaag atttcggctc acttattcat 4800 
gcaagcactt cctgatgggt aaggtcttag agcaagctga ggccaagagg cgggcagtcg 4 860 
aggtgctgct gcaggcaccc ccactcccta cagtggcaag cccaagccca gcccttggca 4920 
gctcaaatcc caggacacgc tgaaggtcac -eeagagagtc aggggcatgg ctagaaecag-4980 
aacccaggac tctggggacc cagcatggca tcctttcctt cattacaaat ctgagctgct 5040 
ttgtttccta gggatttctg tgatattcca aggggactgt gggaaagaaa gtccttggaa 5100 
accaccagga cgctagaggc ctggcctgga gcctcaggag tctcggccac cagagggcgc 5160 
tgggtccttg tccaggtcca gttgctacgc aggggctgcc tgtgctggga ggctccccag 5220 
gggacacaga ccagagcctt gcaccagccc aaggaatggg agcctggggt cctctctgct 5280 
ggaggactgc caggaccccc aggctgccgc ctcttccttt gctcatttgc tgtttcactt 5340 
tgtcaatcct tcctttcttc gtgtgttcat tcacatccac tgtgtgctgg ccctggggaa 5400 
atgttagata agacacatta gctgtgtgtc ttcattgtcc taacaaagaa cacaccctgg 5460 
aaagagcacc gcagagagtc cccattcccc catctccctc cacacatgga atctggagat 5520 
gccttttcca catccagatg tctctggtgc tgtgggattc ttaaataaac aaacatttca 5580 
tacagaatgt gagatgatgg agatgctatg gggaaaagta aagcagaggg agggcctagt 5640 
gtgtgatgcg ggtgaggcat ccagggattg ctgtttcagc tgtgatcagg aaaggccctg 5700 
ggaggaggcc acatctgagc agagacctaa ataaagttgg aaacctgttg ctgagatatc 5760 
tggagaagtg tttcaagggc cgggcaccgg gcatggtggc tcacgcctgt aatcccagca 5820 
ctttgggagg ccaaggcagg tggatcgctg gaggtcagga gtttgagagc agcctgacca 5880 
acatggagaa accccatctc tactaaacat ataaaaatta tccgggcatg gtggttcatg 5940 
cctgtagtcc cagctactcg ggaggttgag gcaggagaat cacttgaacg tgggaggcag 6000 
aggttgcagc aagccgagat cacaccactg cactccagcc tggatgacag agcgagactc 6060 
cgtctcaaaa aaaaaaaaga aaagaaaaaa gaaaaaaaaa gaaaagtgtt tcaagcaggg 6120 
gaactggcaa gtggagaggc cctgaggcag aaatatgctt ggcctgctgg aggaaatgtg 6180 
agtgaggagg tcagggtggc tggagtggag ggagcgagtg gtaggagtca gacccagttt 6240 
attcatattc tgtaggtctt aaggacttca gttttatttt gagtgcaata tgagcccact 6300 
ggaatgctaa aagctgagag tgacatggtg ctgtgattct ggctttaaaa atatcacttt 6360 
ggctgcttcg tgaagactct ggaaggggca agggtgaaag cagggatgcc cgttaggaga 6420 
ccgttacagg ggcgcaggca caaaatggca gtggctggga caatggtggc agcagcggtt 64 80 
agatgtgaac atgttgaagg tggaatttgc agaatctggg ggaggacaga agagaaagga 6540 
taacttcatc gtttctgctg aaccagttgg ataaatgttg gtggcacttc ttgaagtgag 6600 
gaaggagtta ggaaggtggg aaaggcacaa gtttgaattg ggccatgatg gtctgagata 6660 
cctagtacag tggttcccca acctttttgg cagaagggac cgctttcatg gaagacaatt 6720 
tttccacaga ctgggggtgg ggtggggatg gtttcagggt ggttcgagtg cagtacattt 6780 
atcattagac tctttttttt tttttttttt tgagatagag tctcgctctg tcacccacac 6840 
tggagtgcag tggagccatc ttggctcact acaacctctg ctgcccaggt tcaagtcatt 6900 
ctcctgcctc agcctctcaa gtagctggga ttataggcat atgcgccacc acgcccagct 6960 
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aatttttgta tttttagtag agacggggtt tcaccatatt ggccaggatg gtctcgaact 7020 
cctgacctca agtgatcctc ccccgcctca acctcccaaa gtgctggggt tacaggcgtg 7080 
aaccactgca cccggcccat ttatcattag attctcataa ggaatgagca acctagatcc 714 0 
ctcgcatgca cagttcacaa tagggttcac gctcctatgg gagtctaatg ctgccgctgc 7200 
actcagcttc tctggcttgc cgctgctcac cttctgctgt gcagcccagt tcctaacagg 7260 
ccacaaacgg ggagttgggg acccctgatc tagtaaacat ctaggcaggg ttttggataa 7320 
tggagttaga gttcctgggg agaggtcagg ctggccatga aacatgggat gcctttgcat 7380 
ataggtggtg ttgaaagcca caggacagta cggggtctca gggggtgagc ataaagagag 744 0 
gcgacatcag atggccaagg ccagaggcag aggaggatgg gaaggagggg ccagtggggc 7500 
agggggaagc tgtgaagcca gggaaaaagg gtgtttcgcg gaaaaggatc aacctggacc 7560 
agtgctgccc ctaggcaggg caggatgaaa cttaaccacc acggattcca tggccccatg 7620 
gcctccaggc cacaggggac cttgagaaga gagatctcag gggacgggtg cggacaagag 7680 
cccgcctggc atggcttcaa gagataactg aaggaaagca agtggagacg cgataaacag 7740 
acaactccct ggaggaattt tactctcgag aggagaatta aagggtagta gctggagagg 7800 
gatgtggggt caagagaagg tctttaacga cgagaactct cacggcggtt tgtgcagaac 786 0 
agggtgggtg tgatgactgt ggatggagag gggagaactg cagcgactct gtcctaggag 792 0 
gaggtgatgg gccgggacca ccaagcgagt ggagggtgga cgccccttcc ctcaccccga 7980 
cacccgcatg tgctcagtgt ccgtgccgcc ggccctagtg cctgggctga acgcggggcc 804 0 
gggactctga ggacgcctcc caggcgcgca gtccgtctgg ccaaggtgga gcgggacggc 8100 
ngcttccgac ggtgcgcggg tcggctcggg gttgcaggga catccggcgt ccgctcctgc 8160 
cctgttttcc tgccttcgca gagcgttgcg caactctagc tttaaacgcc cctgtccccc 8220 
tcaacttgtc tcccccagcc cctctgattt acagattctg cagtccccga gggttgcgcc 8280 
tacgataccg acactcgcgg_cagcctgcga=ggcgagtatg^atcgtcccat ttttcggagt 8340 
agcaaactaa ggttcagaga ctactatgtc ccaggtcggt ctggtttgaa ggtccgcttt 8400 
cctctccctc cgccagcggg cggtgcgagg gactgggcga ggcagcgctt ccctaaggag 8460 
gcgacccgca gccccggccc cctcccgact ccgccccgtt gcagggcccg ggtcggcgag 8520 
gcctctcagc tctaagcccg acgggacttg gtgattgggc aggacggaag agctgggtgg 8580 
ggctttccac cagcggagaa agtctagtgg gcgtggtcgc gacgagggcg tggcctggtg 864 0 
ccccgccccc gtccgcgcgc tcaaagtgga gggtggctgt gggggcgggg tcagaacact 8700 
ggcggccgat cccaacgagg ctccctggag cccgacgcag agcagcgccc tggccgggcc 8760 
aagcaggtat cgacgaccgc gcggggcgtc ttgggctgga ccaggcgggc gcccggggcc 8820 
tgctgaggac cacaaagggc actgggggtc gtggtccagg ctgtgcttcc tcccgctggc 8880 
cctggcccct gcctccgccc ccgcccccgc cttcctgccg ctaagccggc tgcggcgggg 894 0 
ccgattggcg cctgccggct tcctgcgccg gggccagtct aatgcatggg gcccgggcgg 9000 
gggactaagg ggaaactgag tcacgtcggt gtgggagcag ttctgtgtgg gaggcaccac 9060 
cccccactgg gctcggggaa ggatccccct ccaagctatg cttgagggtc ccagccccca 9120 
tctgtctcca caggggccgc accccactcc cgccttcccc ttcttcagca cccaggggtc 9180 
ccgccctggc tcccagcagc ctcgactggt cccggaatgg ctaggaggat ccgctgcagc 924 0 
cgcctccctc ccctcccctc ccctcccctc ccctcccctc ccctcccctc ccctcccctc 9300 
cccctcgcgt cccaagcccc cgtgtgctcc ctccgctggc tctccgcaca gtgtcagctt 9360 
acacgcctta tatagtccga gcaggctcca gccgcggcct gctgccggga cctgggggcg 9420 
ggggagagga gagccggccc ctgactcacc cggaccgccc gaggctccag gctggcttgg 9480 
9999 a 99 cc 9 cgccagttta gtccctcggc ccacccctgg ttgcaaagaa cctcaagcct 9540 
ggattcaggc acccctcacc gttccagtcc caaggggagg ggggctgctc ctgtctttcc 9600 
aaagtgaggt ccgccagcca gcagcccagg ccagcctgac aaaatacctg cctcctatgg 9660 
cttgggcgtg ctcaggggct gcccgtgcct gcctggcccc tgtccaaggc tggtatcctg 9720 
agctggcccg gcctgcctgc ctgcccgccc accatgctgg ccactcacct tctcttctct 9780 
cctctcagga gccggcatca tggattcctt caaagtagtg ctggaggggc cagcaccttg 9840 
gggcttccgg ctgcaagggg gcaaggactt caatgtgccc ctctccattt cccgggtgag 9900 
cctaggtttg gggagggggc tcccccagcg gtctttcggt gcttaggtct ccagagggtg 9960 
atggggggag tcctaacagg agctggtcag gggccagcag gccaggagat gtctaggtcc 10020 
ggagatgtag tggtacctgc ctgccacaag gactcccaat gaggtggata ctgggaggga 10080 
gcacccaggc ttctccagcc ctgcactgta cccgatgctg ttctcccaag ctcctgtggc 10140 
cacctctgag ggctggaggg aggctcattg tgcaggatgg gagcctaaca tttcaggagg 10200 
tatctaaact tgaggtggca atgcttggag ccaggcccca ggcaggacac tgtgactata 10260 
ggatttcact tcagcctcac tgccgcccag ggaatagcaa tcctcatccc gtttttccag 10320 
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atgagagaag aactcatgga gaggtggcgg ggctcgctca tcgagtccat ggtgaagcag 10380 
ggattggaat tgaggcacag catggcgtac attttttgtg ggtagaaggg gtctctcccc 10440 
agcctatgta aggacccaca tccactgttc ccattcagga tgtggtggcc tttgacccca 10500 
agcagaagtg taggacaggg ctccattcta ggggcttaac ttcagcttcc aagagcctgc 10560 
cctggtgtgg gtggagctgg aggctggctc ctccctgtag cagggggatt gccttataag 10620 
cccaagaatg cagccccacg ctgggatggc caacagtggc tgcggtctgc agagctgaaa 10680 
agggctggcc taggcctggc cccctgaacc ccactggtgg gcctctcagc tggtcaccag 10740 
gctgcagctc cagctgtatg gtccagttgt gagacacaac aaattgcctg cccagagtgg 10800 
gtgaggccag cctgtcggct ggcatctctg actggcctgg gggtcaggag ggggtgggga 10860 
cttcctgccc ctatatccgc ctgccccgag agacccaccc aggcgccggg tgggcaggca 10920 
gctgttgtca ggaagcccaa ggcaagccca gcctggaggg gcccagaggg tcgtggcctg 10980 
aggaggggct caagctggag tctgtctgta ggagctgggc gtgggggtta gggtgggcag 11040 
gccagcagtg ctcttctcag gggtcctttg atggcattct cctggaacct gccccgccag 11100 
cagggtagtg aggcagtggt tgccctatga cacacgtccc actacatagc cctcacacag 11160 
ccctgaaacc tacctgacgt cctgctccct gggaaagtgc tggcccagtg tgtctgggga 11220 
gcctgaacct cagtttcttc cctgatggag atgactttca gatatggcct gttgggggca 11280 
ctccgggctc cagctccctg gtcagcatcc ctggcatgtg ggcggggcca ctagctgatc 11340 
ccagccctgg agttggacct gggcccacat gggtgggtga ggtgggcttt tctgagttag 11400 
gccagccccc tccccctccc ctgaccccag aatggaggga ggtgggaggg gcaagggctg 11460 
gctgtgggcc caggcctggg agatgaggta acgtctggga ctggggggct gggctgctca 11520 
ggctgactca cccccacctc atgcagggtc cagccccctg gctttttccc tccttggtcc liSso 
ctctggcctt accctgcccc tggcttgagc ccctccctgc ctctctccag ccacccgccc 11640 
agcgctgtct tctg .c.tc .t.cc tgctgccctc c ccac gctct gaacacccct catcctctgt 11700 
gcttcctgcc ctcctcactc tgggaaggga agccgtcccc gccccccacc ccctctccag 11760 
gagccagcta gctgcacccc aagaccccca cctcgggctc agcccacagc tcccaggagc 11820 
cagccctgtg ggcagggagt ggctgggcca ggtttccctt ctactgactc accatgacct 11880 
tgagtaagtc acttcccctc tggggtgtca cttccccata cacagtataa ggggttgatt 11940 
tagttggatt gaactaaagg tgagggagtg gctcagggtg tctccaggtg ggctgacccc 12000 
tcagttgggc ccccatgctc agcagaggtg gcccacagtg gtggagcctt agggtcagag 12060 
acacttcctg gctctgcctc ttactagctg ggtgacttga ggcaagttgt ttaacctctc 12120 
tgtgtacatt tgcaagtgca aaatgggtaa aatcccagat tactccacaa ggttgttgga 12180 
agattcagtg tcaatatgta gcatagttgg tgctcaataa actgaagcaa gtcttcttat 12240 
ttagcgagtg aggaaggggc cgccgagctc tcttagcctt ctgacctcct acgcaagcaa 12300 
gaggtcatgt tgagcccagc tcgcctttct tttcccagtg ctgtcaagct ctgtgcctgg 12360 
ctgccctgcc ctctgacatc tctctgaaac ctcttgcctc ccctctccct gcctcagctc 12420 
agtctgtgca ctgacccacc tgaggagcct cctggggcca ctggcagcct ggaccccccc 12480 
agatcccccc cacccagtga aattgtcttc cagcactgcc tcacaaaagc ctacttgatg 12540 
cagtgccagg cctcttgcca gatggctggg tggtccctta ggcttggacc cagtcaagct 12600 
gccctgcctg tgttgctggg gctgggctag aggcctggaa ggggtttatc agggtcaccc 12660 
tctcagggcc tgggagatac ccaatcccag acattaaaac tgccagtagc ccctctacct 12720 
tcaaagccaa gtcctggtcc cttcccctgg cattcaaagc catcgtaagt gaactctcac 12780 
ccgctaggca gcacacgcca ttctccttta ccgaggccca ccgcttcctc aaagtcattc 12840 
ctgatggtct cagctcatgc tggtggcagc catttctccc agcctactgt ctctactcat 12900 
tgccacagga accagggact cccagctcaa gagcctgaag gattggggtc aggggaaatt 12960 
ggcagtcgag ggcttgggag tgacagccat gtatggccta cgaagtccca gctgtcaact 13020 
taggtcccat tcaggcagtg ttcacaggga accgggagat aacagggcct gttcctggct 13080 
ctcaaagggt cccagcagac ccctatagat ggcccccgac agggtgctgg ggggtgagag 13140 
gtccataaga gcccccggtg gtttcgggga ggaagctgcc ccctgcatgg gccagagggc 13200 
atatctggta ggtggagtgg cctgggcagg aggccagcag gagcctcaaa aggcaatggt 13260 
cctcctgaaa cacttgggct ttagcctgag cgtggctgtt tgtggacatc atagcaattt 13320 
ctggactgtg ggggagggtg gtggcggtga atagataagc atcgtgactg gggaagctca 13380 
ggtgagcacc acctgaggga gagggtctgg cagtgaataa ataagcagtg tgactgggaa 13440 
attgtgaagc tcaggtgagc gccaccacct cctgggttgc tttagtgtcc agcagctgcc 13500 
tagaactatg ttgaatgaag agctctctgg gttctggaag tgggacagct ttgggtgggg 13560 
cagtgttacc accgtcagcc tggcttgggt ctgcagggtc cagggcctcg gtcactttgc 13620 
ttctctctcc acagctcact cctgggggca aagcggcgca ggccggagtg gccgtgggtg 13680 
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actgggtgct gagcatcgat ggcgagaatg cgggtagcct cacacacatc gaagctcaga 13740 
acaagatccg ggcctgcggg gagcgcctca gcctgggcct cagcaggtat gcgggtggac 13 800 
atggatgggt gcgcccgcgc tggcagtggg gatccctgcg gcccggcccg ctgtcacgct 13860 
ttccttctcc tccagggccc agccggttca gagcaaaccg cagaaggtac gaggctggcc 13920 
gggacatccg ggcggtgggc ggtgtgggct tggacggcca ggcctgctcg ccctcctggc 13980 
acattctcgg taccccaatc cctggccggg agtggagggc agaaaccgga gctaaggcgg 14040 
gtctagggcc ctggagttga gccaggggct gctgcacggt cctggcacca cgcatgtccg 14100 
cctgtctgtc cgcctgtctg tccgcctgct gcctcccgcc gccggcgctg cgtgctcgcc 14160 
cgcactcggt cagccctcgg tcctgcgtgg actgagatcg ccactcccaa atgggcccct 14220 
tgaaacctga gtcgtcctct ccccgtagcc tccaaataga tgtagggggt ggggtggggg 14280 
tggggggctg gagctgccgc tgtcctctgc tgcaggcgcc ccacttccac ccaggccccc 14340 
accttaccct gcccgcccgc cctgcccggc tgtgtctctg cccaggcctc cgcccccgcc 144 00 
gcggaccctc cgcggtacac ctttgcaccc agcgtctccc tcaacaagac ggcccggcct 14460 
ttgggcgccc ccgcccgctg acagcgcccc gcagcagaat gggtacgtcg gcccctgccc 14520 
gcccgcgccc acgccatcag gcccactgtg gccccacgcc cgctgcccgc tgctgctcag 14580 
tctgtgctgc gccccagccc ggcggaaccg tgcggcacgc cccctggcgg ccggggtggg 1464 0 
gctgcaggca cagggcccct cccgaggctg tggcgccttg cagggcaccg cctggggagg 14 700 
ggtctctgaa tgacgccgcg ccccctgctg gcggctgggg gttgggttgt ggtgtcgggc 14 76 0 
cagctgagcc ccagacactc agtgccgcct tgtccccggc tgttctgacc cctccccgtc 14820 
tttcttcctc tcctgtgtct gtccctttgt ccctttatct gtctgtctgt cttatttcct 14880 
tcacaggtgc agacccctga caagtcagtg agcccccctc tgcctgtgcc tttcttcttc 14 94 0 
cttttggcac tctgggtggc ggcccctccc caccctggct gccctcctct ccacttcgcc 15000 
ctcctgtcct ctcacctacc cgcccagcag-ggetectggc ctcaccctta ^cccactccct 15060 
cccatcactg taacccaaac ccacatgcac caaatcctgg gaggggctgc ccccaccgcc 1512 0 
cacccccagt gtggggttct gagccacacc ctccccacag acagccgctc cgaccgctgg 15180 
tcccagatgc cagcaagcag cggctgatgg agaacacaga ggactggcgg ccgcggccgg 1524 0 
ggacaggcca gtcgcgttcc ttccgcatcc ttgcccacct cacaggcacc gagttcagta 15300 
agtgccagcc cagggcaggg ggtactttcc tcgcccccag cccaggcgtg atccctgacc 15360 
ctgtgtcttt tttggtcaat gcctgcctct gctctctcag tgcaagaccc ggatgaggag 15420 
cacctgaaga aatcaaggta cagggacggg caccagcccc tctcccacct cctgcctctt 15480 
ccattccagc tactgccctg tgtctactcc tgaggctccc agctggggct ctcaattctc 15540 
ccttccttcc ttccttcctt ccttccttcc ttccttcctt ccttccttcc ttccttcctt 15600 
cccttcctcc ttccttcctt ctttcatttc ttccctccct ccttccttcc ctcctccctc 15660 
cctgcctccc ttccatctct ccttccttcc acttcttcct ccctctctct ctgcccctca 15720 
gggaaaagta tgtcctggag ctgcagagcc cacgctacac ccgcctccgg gactggcacc 15780 
accagcgctc tgcccacgtg ctcaacgtgc agtcgtagcc cggccctctc cagccggctg 15840 
ccctctctgc ctccctcttt ctgttcctcc tgcccagggc acccccttag tgcctccagc 15900 
ttctgcctac ctcacccccc ctttcgtgcc cctggcctga gcctcctgct ggcctggccc 15960 
tggccgccca cctgggttca tctgacactg ccttccctct ttgccctgtg gtactgctgt 16020 
ctgccaggtc tgtgctgcct tgggcatgga ataaacattc tcagccctgc ttgctctgcc 16080 
tgtcttctat ctttgtggac ctggtttgca tttggggtgt gggggtgttt cgtggttcgg 16140 
actgtttggg ccctgccgtc cttgttttca gtgggagggg gtacctggca aaggggccct 16200 
gccctgccat cacagatggc ttcctggcat gaggggagcc ccaggagctg cctcagaagc 16260 
gggagccctg cctcgtctcc cagctagaga ccgcacacca gctaactgga cattgctagg 16320 
agaagctgcc cttcccatcc ctaccccagt gggacctgga atccaactcg gcagtttcca 16380 
cgcccccagt catctcccgt ggggccagca ggacccaggt tggggggtgg ggccatgtca 16440 
ggaagctcag ccatgcaggg ccttgaatgg cagatcttgc agccaggtgc ccaggacaga 16500 
agccccagcc ccagcctcat ctacacccca ggagccctgg cctggtgaga gggagtgggc 16560 
tcgggcctgg gcaagggtgg gcagcctcca ggggcatggg ggtggtgggc ttctctcagc 1662 0 
tgcctggggc tccacccccg tcctttgggg tccctgggca cccctttaga gtcactttcc 16680 
ccggcaggcc ctaccgcccc cagccctacc agccgcccgc cctgggctgt ggaccctgcg 1674 0 
tttgccgagc gctatgcccc ggacaaaacg agcacagtgc tgacccggca cagccagccg 16800 
gccacgccca cgccgctgca gagccgcacc tccattgtgc aggcagctgc cggaggggtg 16860 
ccaggagggg gcagcaacaa cggcaagact cccgtgtgtc accagtgcca caaggtcatc 16920 
cggtgggtgg cctgttcctg tccgaccctg gctttcccat cctgcagccc agccccacct 16980 
gtctgcccac ctgtcttgcc tcagctgcga ctggggggaa taaggattca gttctcagct 17040 
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ggagtaggag tagggacctg ggctgggtcc tcccattctt aatcccacgc tacctacccc 17100 
agcccaccca caacaactgc tagcagcatc tgccgtggcg aaatagccga agggccaacc 17160 
ataggctgaa gctgcacccc tacctttgct gctctctggg caaagagggg cctgccccct 17220 
cccagcgcgt ctgcccctcc ctcctgctct ctgtctccct ctgctctcag agcatacagg 17280 
cctggagcca ctccctctgt gcactgcccc gtggggccaa gcagcatcaa acacccccca 1734 0 
gcatcagcgt gccggattct agagccttcc taattcgcag gcctggcctg ctctcatctc 17400 
tgtcagctct tttttttttt tttttgaaac agagtctcac tgtgttgccc acgttggcgt 17460 
gcagtggcgc gatctcggct cactgcaacc tctgcctcct gggttcaaga gattctcctg 17520 
cctcagcctc ctgagtagct gggattacag gcacccgcca ccatgcctgg ctaattttgt 17580 
atttttagta gagacggggt tttaccatgt tggccaggct ggtctcaaac tcctcacctc 17640 
aggtgatctc aggcctgcct tggcctccca aagtgctggg actacaggtg tgagccactg 17700 
tgcccagccg actctatcag ctcttgccag gtagaacagg caggccagca ggacagggca 17760 
gctccagggt ttgcccaggg gcggctcagc ttttatgagg ctccagtcgt cagcccttcc 17820 
tcccggggtc ctccctgctc taaagctgcc tctcctgtca ccagcagttc agtgtggcgg 17880 
actggctctg taagcttcat ggctgccacg gtcacttccc aagcctgtct tctatcctat 1794 0 
gtggaaaatg gggagaatga actgtccctc ccaaggcctc ctggtgggtg gtcagtcaac 18000 
ctgaaggggg ccaagacccc cacctctctg cgtgtgctcc ctctgaccgc tctcgcctcc 18060 
ctgcaggggc cgctacctgg tggcgctggg ccacgcgtac cacccggagg agtttgtgtg 18120 
tagccagtgt gggaaggtcc tggaagaggg tggcttcttt gaggagaagg gcgccatctt 18180 
ctgcccacca tgctatgacg tgcgctatgc acccagctgt gccaagtgca agaagaagat 18240 
tacaggcgtg agtagggctg gctggcgggg aggtggtccc aagcctgtca gtgggaacga 18300 
gggctgctgg gaaacccaca gtccaggtct ctccccgagt gagcctccgg gtccttacca 18360 
— gcgtaataaa = tgggctgctg=tactggcctc=accctgcatt=agtcaggatg=ctcttaacaa-l=8420 
atgaccatgt tcctgctcag aaaccgccca aggctgcaaa gagcaggagg accaagccag 18480 
gagaagccct gggccctcct gactcccact ttgggctctc cctgccctgg tgaaatgaca 18540 
gaacggccaa cttgacacgc tgaagctgct ctgtctcatg cgtcctcctc atttctggat 18600 
ccagagccag ggctgccagg agtagccaga gagctctgtg tggtgatgtt catattagtg 18660 
aggtttacct tgaccacgag cagtgggaaa ctcaaaataa tggtggctta tttctcatct 18720 
aaaaacatcc cggggtgggt ggtctgggac tgatctggtg gacccaggct ccgccttgtt 18780 
gcttgactgt tggcagcacc tgcttactta ccactcatgg tgcaagatga cacttcagcc 18840 
tccgccaaaa tgctcacctt ccagccagca ggaagtcgga aggagaagaa aggggacaga 18900 
gccccatggc gtccatcctt agaggatgct gccacctgaa cctctgcttt catcctgttg 18960 
gtcagaaccc agtcacatga ccacacccag tggcaacgga ggctgggaaa tatagtcttt 19020 
attttgggca cccatgtgtc cagcaaaact gggggttcca tcagtcggca agaacgggag 19080 
agtggccgat gcagtggctg atgcttgtat cccagcactt tgggaggtcg aggtgggcag 19140 
atcacctgag gtcaggagtt caagaccagc ctggccaata tggtgaaacc ctgtctctac 19200 
taaaaataaa aaaattagct gggtgtgctg gcgcacctgt agtcccagct acttgggagg 19260 
ctgaggcagg agaatcgctt gatcttgaga ggtggaggtt gcagtgagcc aagattgtgc 19320 
cactgccttc cagcctggga gacagcaaaa aaaaaaaaaa aaaaaaaaaa aaaaagggcc 19380 
aggcacggtg gctcacacct gtaatcccag cactttggga ggccgagatg ggcggatcac 19440 
gaggtcagga gattgagacc atcctggcta acacggtgaa accccatctc tactaaaaat 19500 
acaaaaaaat tggccgggca tggtggagta gtcccagcta ctcgggaggc tgaggcagga 19560 
gaatggcgtg aacctgggag gcagagcttg cagtgagccg agatcgcgcc actgcactcc 19620 
agcctgggca acagagcgag actcttgtct caaaaagaaa aaaagaaaga gaaatctgcc 19680 
tcccagcctt gggctcctgc cctaccagcc cacacccctg gtagagcctc ctctcccacc 19740 
agctcaaagc ccaagttcct tcactgtgac cttgtctgct cctctaaaac aggcaacacc 19800 
agacagtgag aagagccagc cagacatggg cagaaaacct atttctgtga tctactggct 19860 
gtgtgagcag gggctagttg ctctctctgg gcctcactga agagaagggt ggcactatgc 19920 
tagggccggc acggttgcaa ggtagatgta agatggggta caggtgttgt ggagggcaga 19980 
aatgcaccat ccgaaggcta catgtccccc acacttatgt cttgcttggc ccacactgtt 20040 
tcattttaaa atcagtagca aacaatttaa aaaatcagaa gatttgcctg catgatgcag 20100 
tggctcatgc ctgtaatccc agcactttgg gaggccaagg tgggaggatt gcttgagccc 20160 
aggagttcaa gaccagcatg ggcaccatag caagacccct gtttctacaa aaaaaaaaaa 20220 
attagaaaat tagccaagtg tggtggcatg cacctgtggt cccagctact tgggaggcag 20280 
agggaaagtg agatctcctg ctttttattt ctttatgtat aatgataggg tcttgctctg 2034 0 
ttgcccaggc tggagtgcag tggcatgatc actgctcact gcagccttga tctcctgggc 20400 
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tcagaggatc ctcccacctc agcctcccaa atagctagga ctagaggtgc ccaccagcat 20460 
gctcagcaga tttttaaatc tttttgtaga gatgaggttt tgctatgttg cccaggctgg 20520 
tctcgaactc ctggcctcga gcgatcctcc caccttggcc tcccaaagca ctgggattac 20580 
agacgtgagc cactgcgccc agcagatttc tctttaacac ctagatttca gcctgagcca 20640 
ggcaggcatt cctgaatgaa ccagtagtac tgctcccaga agaagaggtc ctcctccgtg 20700 
tgacacagtc cccacttggc ccttgcaggg attggatctg ggatccctgg atttaaactc 20760 
agggccatcc tcataacagc ctcacaaggc tgggattagc ttcccagttc acaagggaag 20820 
aaaccaagac ttgagaaggt caaggtctgg ccagacccac acatcttgga ccctcatacc 20880 
gcctcgaggc cccatgctgc cctctgcctg ctccagatgt gaatactgct ggccctggct 20940 
ggccccggct ggccccgagg gtcctaggga tgaacagccc agcccaggga gagctcagcc 21000 
ccttgtgcct ctgccccttc ccacctcctg cggaggccag tcgactcacc cacaaagggc 21060 
caggcactgt ggggatagat cagctaacaa aacagttgat gcttcctgcc cttctgggcc 21120 
ttacattttg gctggaagaa gaggggagag gcagactgta agcaataagc gcaataagta 21180 
ggttgcctgg aagtaatgtt agatcacgtt acggaaaaca ggaaagagca gagcgacaag 2124 0 
tgctggggtg cgtggtgcag ggaaggcagc tggctgctgc tggtgtggtc agagtgggcc 21300 
ctcatggaga agactgcatt cgagcagaaa cttgaagggg gtgaggggtg agcctagaga 21360 
tatctggggc agagcagtcc aggcagaggg gacagccggt gtcaagccca ggacaggagt 21420 
gtgcctggtg tgccagtttc aggcaagagg ccagtgtgca gaggcaaggt gagaacgcaa 21480 
gggagagcag tggcggagac gggtgggaac gaggtcagac ctgctggcct ccagcctctg 21540 
catggggctt ggctcttgct gggagcaatg ggaagcagta cacagtttca tgcaggggga 21600 
gaaggcctgt cttgggttgc aggggcacgc tgtggcagct gggatcagag agaggagctt 21660 
gtaggccagt tgttatgtgg tcccacgggc cagatggcca tggcttacct cacttcaggg 21720 
aggctgtgag-aagcactcag aatcfeggatg tgccttgggg=gfcgggcccca-ctggatttcc 21780 
tggtggacct ggtgtggggt gtgagaggag ggtgtgtttg gctgcagcag acaggagaat 21840 
ggagttgcca tccgcgtgat ggggatggct gtgggaggag aggtttgggg tgagggaatc 21900 
aggaactgag tgctggacat ggcaagtctg aaggcgcagt ggtcgtccac tcagagacct 21960 
tggagttgga gatggaggtg tgggagtcct gaacagttag atgtagtgtt taccgcgaga 22 02 0 
aggaacaggg cttgcggcca gccctcctgt gttcccgtga cccagggcag ggcaggaggg 22080 
gcctgagcct gccgagtgac tgggacctcc ttccaggaga tcatgcacgc cctgaagatg 2214 0 
acctggcacg tgcactgctt tacctgtgct gcctgcaaga cgcccatccg gaacagggcc 22200 
ttctacatgg aggagggcgt gccctattgc gagcgaggta cccactggcc agtgagggtg 22260 
a 99 a 999 a tg 9t9 cat 9999 caggcatgaa tccaggtcct ctttctctct gcccccattc 22320 
tcagactatg agaagatgtt tggcacgaaa tgccatggct gtgacttcaa gatcgacgct 22 380 
ggggaccgct tcctggaggc cctgggcttc agctggcatg acacctgctt cgtctgtgcg 2244 0 
gtgagagccc cgcccctcga actgagcccc aagcccaccg gccctctgtt cattccccag 22500 
gagatgcagg agaagttggg aaggggcctc tcctgctgcc cccaacccca tgtgactggg 22 560 
cctttgctgt ccttagatat gtcagatcaa cctggaagga aagaccttct acxccaagaa 22620 
ggacaggcct ctctgcaaga gccatgcctt ctctcatgtg tgagcccctt ctgcccacag 22680 
ctgccgcggt ggcccctagc ctgaggggcc tggagtcgtg gccctgcatt tctgggtagg 22740 
gctggcaatg gttgccttaa ccctggctcc tggcccgagc ctggggctcc ctgggccctg 22800 
ccccacccac cttatcctcc caccccactc cctccaccac cacagcacac cgatgctggc 22860 
cacaccagcc ccctttcacc tccagtgcca caataaacct gtacccagct gtgtcttgtg 22 920 
tgcccttccc ctgtgcatcc ggaggggcag aatttgaggc acgtggcagg gtggagagta 22 980 
agatggtttt cttgggctgg ccatctgggt ggtcctcgtg atgcagacat ggcgggctca 23040 
tggttagtgg aggaggtaca ggcgagaccc catgtgccag gcccggtgcc cacagacatg 23100 
aggggagcca ctggtctggc ctggcttgga ggttagagaa gggtagttag gaagggtagt 23160 
tagcatggtg gctcatgcct gtgatcccag cactttggaa ggccaaggtg ggcagatcgc 23220 
ttgaggtcag gagttcgaga cctcatggcc aacacggtga aacagcgtct ctagtaaaaa 23280 
tacaaaaatt agccgagtgt ggtggggcat gcctgtaatc ccagccactc aggaggctga 23340 
ggcgggaaaa tcacttgaac ctgggaagtg gaggttgcag tgagctgaga tcacaccact 23400 
gcgcgcgagc ctgggtggca gatggcagag cgagaccctg cttcaaaaaa aaaaaaaaaa 2 3460 
aaaaaaaaaa gaagggtagt tgtagttggg ggtggatctg cagagatatg gtgtggaaaa 23 520 
cagcaatggc cacagcaaag tcctggaggg gccagctgcc gtccaaacag aagaaggcag 23580 
ggctggagag ggtagccctt aggtcctggg aagccacgag tgccaggcag tagagctggg 23640 
gctgtctctt gaggttaggg cagggcaagg cacagcagag tttgaaatag gtttgtgttg 2 3700 
tattgcagaa aagaggcccc agaacactga gggagtgcag gagggaggct gggaggagga 23760 

24 



BNSDOCID: <WO 0066178A1J_> 



WO 00/66178 



PCT/US00/11664 



gttgcagcag ggcctagggg cgggggccag gcaagggagg ggcagagagt aatatggcag 23820 

agatgggacc cagtggcagg tccgggggat gagggatgga gagaaggaca ggagcgttgc 23880 

caggcatctg gcctatacca gacatgctca cgctgtctcc cgcgaacctc ctagcaacct 23940 

tgcgccgttg tctgcaatca cttatttcat tttttctttt ttaactttaa ttttttttgt 24000 

ttttaagaga caggatctcc ctaggttgcc cgggctggtt tcaaactcct gggctcaagc 24060 

aattcttcct ccttagcccc aaagtgctgg cattacaggt gtgagccacc atgcctggcc 24120 

cacttatttt ctagatgagg cacagaaaga ttgggagact tgaccaaggt cacgctgtca 24180 

ttgagccatg agccagacta gaatccaggc ctgaagctgg gtgcgctgtc ccaggactgg 24240 

ctggcactga gtaccatttg ccagcgagca tctctctggg aagctgactt ctgcccggta 243 00 

cctggaggac tgtagacctt ggtggtggcg ccgtcactct ggggcttcct gcctcccact 24360 

gatgcccgca ccaccctaga gggactgtca tctctcctgt cccaagcctg gactggaaag 24420 

actgaagaga agccttaagt aggccaggac agctcagtgt gccatggctg cccgtccttc 24480 

agtggtccct ggcatgagga cctgcaacac atctgttagt cttctcaaca ggcccttggc 2454 0 

ccggtcccct ttaagagacg agaagggctg ggcacggtga ctcacacctc taatcccagc 24600 

actttggaag gctgaggctg gagaagggct ccagcttagg agttcaggac cagcctgggc 24660 

aacatggtga gaccctgttt tgttttgttt tttgtttttt tgagatggag tcttgctctg 24720 

tcgcccaggc tggagtgcag 24 74 0 



<210> 42 
<211> 25 
<212> DNA 

<213> Rattus norvegicus - = 
<400> 42 

gcactacctt gaaggaatcc atggt 25 
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LIM Mineralization Protein Splice Variants 

BACKGROUND OF THE INVENTION 

5 1 . Field of the Invention 

The field of the invention relates generally to osteogenic cells and the 
formation of bone and boney tissue in mammalian species. Specifically, the 
invention concerns a novel family of proteins, and nucleic acids encoding those 
proteins, that enhances the efficacy of bone mineralization in vitro and in vivo. 
10 The invention provides methods for treating a variety of pathological conditions 
associated with bone and boney tissue, such as, for example, spine fusion, 
fracture repair and osteoporosis. =-= - ______ 

2. Description of the Related Art 

15 Osteoblasts are thought to differentiate from pluripotent mesenchymal 

stem cells. The maturation of an osteoblast results in the secretion of an 
extracellular matrix which can mineralize and form bone. The regulation of this 
complex process is not well understood but is thought to involve a group of 
signaling glycoproteins known as bone morphogenetic proteins (BMPs). These 

20 proteins have been shown to be involved with embryonic dorsal-ventral 

patterning, limb bud development, and fracture repair in adult animals. B. L. 
Hogan, Genes & Develop. . 10:1580 (1996). This group of transforming growth 
factor-beta superfamily secreted proteins has a spectrum of activities in a 
variety of cell types at different stages of differentiation; differences in 

25 physiological activity between these closely related molecules have not been 
clarified. D. M. Kingsley, Trends Genet. . 10:16 (1994). 

To better discern the unique physiological role of different BMP signaling 
proteins, we recently compared the potency of BMP-6 with that of BMP-2 and 
BMP-4, for inducing rat calvarial osteoblast differentiation. Boden et a/., 

30 Endocrinology . 137:3401 (1996). We studied this process in first passage 
(secondary) cultures of fetal rat calvaria that require BMP or glucocorticoid for 
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initiation of differentiation. In this model of membranous bone formation, 
glucocorticoid (GC) or a BMP will initiate differentiation to mineralized bone 
nodules capable of secreting osteocalcin, the osteoblast-specific protein. This 
secondary culture system is distinct from primary rat osteoblast cultures which 
5 undergo spontaneous differentiation. In this secondary system, glucocorticoid 
resulted in a ten-fold induction of BMP-6 mRNA and protein expression which 
was responsible for the enhancement of osteoblast differentiation. Boden et 
a/. f Endocrinology . 138:2920 (1997). 

In addition to extracellular signals, such as the BMPs, intracellular 

10 signals or regulatory molecules may also play a role in the cascade of events 
leading to formation of new bone. One broad class of intracellular regulatory 
molecules are the LIM proteins, which are so named because they possess a 
characteristic structural motif known as the LIM domain. The LIM domain is a 
cysteine-rich structural motif composed of two special zinc fingers that are 

15 joined by a 2-amino acid spacer. Some proteins have only LIM domains, while 
others contain a variety of additional functional domains. LIM proteins form a 
diverse group, which includes transcription factors and cytoskeletal proteins. 
The primary role of LIM domains appears to be in mediating protein-protein 
interactions, through the formation of dimers with identical or different LIM 

20 domains, or by binding distinct proteins. 

In LIM homeodomain proteins, that is, proteins having both LIM domains 
and a homeodomain sequence, the LIM domains function as negative 
regulatory elements. LIM homeodomain proteins are involved in the control of 
cell lineage determination and the regulation of differentiation, although LIM- 

25 only proteins may have similar roles. LIM-only proteins are also implicated in 
the control of cell proliferation since several genes encoding such proteins are 
associated with oncogenic chromosome translocations. 

Humans and other mammalian species are prone to diseases or injuries 
that require the processes of bone repair and/or regeneration. For example, 

30 treatment of fractures would be improved by new treatment regimens that 

could stimulate the natural bone repair mechanisms, thereby reducing the time 
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required for the fractured bone to heal. In another example, individuals 
afflicted with systemic bone disorders, such as osteoporosis, would benefit 
from treatment regimens that would results in systemic formation of new bone. 
Such treatment regimens would reduce the incidence of fractures arising from 
5 the loss of bone mass that is a characteristic of this disease. 

For at least these reasons, extracellular factors, such as the BMPs, 
have been investigated for the purpose of using them to stimulate formation of 
new bone in vivo. Despite the early successes achieved with BMPs and other 
extracellular signalling molecules, their use entails a number of disadvantages. 

10 For example, relatively large doses of purified BMPs are required to enhance 
the production of new bone, thereby increasing the expense of such treatment 
methods. Furthermore, extracellular proteins are susceptible to degradation 
following their introduction into a host animal. In addition, because they are 
typically immunogenic, the possibility of stimulating an immune response to the 

15 administered proteins is ever present. 

Due to such concerns, it would be desirable to have available treatment 
regimens that use an intracellular signalling molecule to induce new bone 
formation. Advances in the field of gene therapy now make it possible to 
introduce into osteogenic precursor cells, that is, cells involved in bone 

20 formation, or peripheral blood leukocytes, nucleotide fragments encoding 
intracellular signals that form part of the bone formation process. Gene 
therapy for bone formation offers a number of potential advantages: (1 ) lower 
production costs; (2) greater efficacy, compared to extracellular treatment 
regiments, due to the ability to achieve prolonged expression of the 

25 intracellular signal; (3) it would by-pass the possibility that treatment with 
extracellular signals might be hampered due to the presence of limiting 
numbers of receptors for those signals; (4) it permits the delivery of transfected 
potential osteoprogenitor cells directly to the site where localized bone 
formation is required; and (5) it would permit systemic bone formation, thereby 

30 providing a treatment regimen for osteoporosis and other metabolic bone 
diseases. 
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SUMMARY OF THE INVENTION 

The present invention seeks to overcome the drawbacks in the prior art 
5 by providing novels compositions and methods for inducing bone formation 
using an intracellular signalling molecule that participates early in the cascade 
of events that leads to bone formation. Applicants have discovered 10- 
4/RLMP (SEQ ID NO: 1, SEQ ID NO: 2), a novel LIM gene with a sequence 
originally isolated from stimulated rat calvarial osteoblast cultures. The gene 
io has been cloned, sequenced and assayed for its ability to enhance the efficacy 
of bone mineralization in vitro. The protein RLMP affects mineralization of 
bone matrix as well as differentiation of cells into the osteoblast lineage. Unlike 
other known cytokines, for example, BMPs, RLMP is not a secreted protein, 
but is instead an intracellular signaling molecule. This feature has the 
5 advantage of providing intracellular signaling amplification as well as easier 
assessment of transfected cells. It is also suitable for more efficient and 
specific in vivo applications. Suitable clinical applications include enhancement 
of bone repair in fractures, bone defects, bone grafting, and normal 
homeostasis in patients presenting with osteoporosis. 

Applicants have also cloned, sequenced and deduced the amino acid 
sequence of a corresponding human protein, named human LMP-1. The 
human protein demonstrates enhanced efficacy of bone mineralization in vitro 
and in vivo. 

In addition, the applicants have characterized a truncated (short) version 
of LMP-1, termed HLMP-1s. This short version resulted from a point mutation 
in one source of a cDNA clone, providing a stop codon which truncated the 
protein. The short version (LMP-1 s) is fully functional when expressed in cell 
culture and in vivo. 

Using PCR analysis of human heart cDNA library, Applicants have 
identified two alternative splice variants (referred to as HLMP-2 and HLMP-3) 
that differ from HLMP-1 in a region between base pairs 325 and 444 in the 
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nucleotide sequence encoding HLMP-1. The HLMP-2 sequence has a 119 
base pair deletion and an insertion of 17 base pairs in this region^ Compared 
to HLMP-1, the nucleotide sequence encoding HLMP-3 has no deletions, but it 
does have the same 17 base pairs as HLMP-2, which are inserted at position 
5 444 in the HLMP-1 sequence. 

Additional features and advantages of the invention will be set forth in 
the description which follows, and in part will be apparent from the description, 
or may be learned by practice of the invention. The objectives and other 
advantages of the invention will be realized and attained by the methods and 
10 compositions of matter particularly pointed out in the written description and 
claims hereof. 

In one broad aspect, the invention relates to an isolated nucleic acid 
= molecule comprising a nucleic acid sequence encoding any LIM mineralization 

protein, wherein the nucleic acid molecule hybridizes under standard 
15 conditions to a nucleic acid molecule complementary to the full length of SEQ. 

ID NO: 25, and wherein the molecule hybridizes under highly stringent 

conditions to a nucleic acid molecule complementary to the full length of SEQ. 

ID NO: 26. In a specific aspect, the isolated nucleic acid molecule encodes 

HLMP-1, HLMP-1 s, RLMP, HLMP-2, or HLMP-3. In addition, the invention is 
20 directed to vectors comprising these nucleic acid molecules, as well as host 

cells comprising the vectors. In another specific aspect, the invention relates to 

the proteins themselves. 

In a second broad aspect, the invention relates to antibody that is 

specific for LIM mineralization protein, including HLMP-1, HLMP-1 s, RLMP, 
25 HLMP-2, and HLMP-3. In one specific aspect, the antibody is a polyclonal 

antibody. In another specific aspect, the antibody is a monoclonal antibody. 

In a third broad aspect, the invention relates to method of inducing bone 

formation wherein osteogenic precursor cells are transfected with an isolated 

nucleic acid molecule comprising a nucleotide sequence encoding LIM 
30 mineralization protein. In one specific aspect, the isolated nucleic acid 

molecule is in a vector, which may be a plasmid or a virus, such as adenovirus 
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15 



or retrovirus. The transfection may occur ex vivo or in vivo by direct injection of 
the isolated nucleic acid molecule. The transfected isolated nucleic acid 
molecule may encode HLMP-1, HLMP-1 s, RLMP, HLMP-2, or HLMP-3. 

In a further aspect, the invention relates to methods of fusing a spine by 
transfecting osteogenic precursor cells with an isolated nucleic acid molecule 
having a nucleotide sequence encoding LIM mineralization protein, admixing 
the transfected osteogenic precursor cells with a matrix and contacting the 
matrix with the spine. 

In yet another aspect, the invention relates to methods for inducing 
systemic bone formation by stable transfection of host cells with the vectors of 
the invention. 

It is to be understood that both the foregoing general description and the 
following detailed description are exemplary and explanatory and are intended 
to provide further explanation of the invention as claimed. 

ABBREVIATIONS AND DEFINITIONS 



20 



25 



30 



BMP 
HLMP-1 



HLMP-ls 

HLMPU 

LMP 

MEM 

Trm 

-GlyP 

RACE 



Bone Morphogenetic Protein 
Human LMP-1, also 
designated as Human LIM 
Protein or HLMP 
Human LMP-1 Short 
(truncated) protein 
Human LIM Protein Unique 
Region 

LIM mineralization protein 
Minimal essential medium 
Triamcinolone 
Beta-glycerolphosphate 
Rapid Amplification of cDNA 
Ends 
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RLMP 

RLMPU 

RNAsin 

ROB 

10-4 



10 



UTR 



Rat LIM mineralization protein, 
also designated as RLMP-1 

Rat LIM Protein Unique 

Region 

RNase inhibitor 

Rat Osteoblast 

Clone containing cDNA 

sequence for RLMP (SEQ ID 

NO: 2) 

Untranslated Region 



HLMP-2 



HLMP-3 



15 



Human LMP Splice Variant 2 



Human LMP Splice Variant 3 



DETAILED DESCRIPTION OF THE INVENTION 



The present invention relates to novel mammalian LIM proteins, herein 
20 designated LIM mineralization proteins, or LMP. The invention relates more 
particularly to human LMP, known as HLMP or HLMP-1, or alternative splice 
variants of human LMP, which are known as HLMP-2 or HLMP-3. The 
applicants have discovered that these proteins enhance bone mineralization in 
mammalian cells grown in vitro. When produced in mammals, LMP also 
25 induces bone formation in vivo. 

Ex vivo transfection of bone marrow cells, osteogenic precursor cells, 
peripheral blood leukocytes, or mesenchymal stem cells with nucleic acid that 
encodes LMP or HLMP, followed by reimplantation of the transfected cells in 
the donor, is suitable for treating a variety of bone-related disorders or injuries. 
30 For example, one can use this method to: augment long bone fracture repair; 
generate bone in segmental defects; provide a bone graft substitute for 
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fractures; facilitate tumor reconstruction or spine fusion; and provide a local 
treatment (by injection) for weak or osteoporotic bone, such as in osteoporosis 
of the hip, vertebrae, or wrist. Transfection with LMP or HLMP-encoding 
nucleic acid is also useful in: the percutaneous injection of transfected marrow 
5 cells to accelerate the repair of fractured long bones; treatment of delayed 
union or non-unions of long bone fractures or pseudoarthrosis of spine fusions; 
and for inducing new bone formation in avascular necrosis of the hip or knee. 

In addition to ex v/Vo-based methods of gene therapy, transfection of a 
recombinant DNA vector comprising a nucleic acid sequence that encodes 

10 LMP or HLMP can be accomplished in vivo. When a DNA fragment that 

encodes LMP or HLMP is inserted into an appropriate viral vector, for example, 
an adenovirus vector, the viral construct can be injected directly into a body 
site were endochondral bone formation is desired. By using a direct, 
percutaneous injection to introduce the LMP or HLMP sequence stimulation of 

15 bone formation can be accomplished without the need for surgical intervention 
either to obtain bone marrow cells (to transfect ex vivo) or to reimplant them 
into the patient at the site where new bone is required. Alden et a/., 
Neurosurgical Focus (1998), have demonstrated the utility of a direct injection 
method of gene therapy using a cDNA that encodes BMP-2, which was cloned 

20 into an adenovirus vector. 

It is also possible to carry out in vivo gene therapy by directly injecting 
into an appropriate body site, a naked, that is, unencapsulated, recombinant 
plasmid comprising a nucleic acid sequence that encodes HLMP. In this 
embodiment of the invention, transfection occurs when the naked plasmid DNA 

25 is taken up, or internalized, by the appropriate target cells, which have been 
described. As in the case of in vivo gene therapy using a viral construct, direct 
injection of naked plasmid DNA offers the advantage that little or no surgical 
intervention is required. Direct gene therapy, using naked plasmid DNA that 
encodes the endothelial cell mitogen VEGF (vascular endothelial growth 

30 factor), has been successfully demonstrated in human patients. Baumgartner 
et a/., Circulation . 97(12):1 1 14-23 (1998). 
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By using an adenovirus vector to deliver LMP into osteogenic cells, 
transient expression of LMP is achieved. This occurs because adenovirus 
does not incorporate into the genome of target cells that are transfected. 
Transient expression of LMP, that is, expression that occurs during the lifetime 
5 of the transfected target cells, is sufficient to achieve the objects of the 

invention. Stable expression of LMP, however, can occur when a vector that 
incorporates into the genome of the target cell is used as a delivery vehicle. 
Retrovirus-based vectors, for example, are suitable for this purpose. 

Stable expression of LMP is particularly useful for treating various 

10 systemic bone-related disorders, such as osteoporosis and osteogenesis 

imperfecta. For this embodiment of the invention, in addition to using a vector 
that integrates into the genome of the target cell to deliver an LMP-encoding 
nucleotide sequence into target celfs,rMP expression is placed under the 
control of a regulatable promoter. For example, a promoter that is turned on by 

15 exposure to an exogenous inducing agent, such as tetracycline, is suitable. 
Using this approach, one can stimulate formation of new bone on a systemic 
basis by administering an effective amount of the exogenous inducing agent. 
Once a sufficient quantity of bone mass is achieved, administration of the 
exogenous inducing agent is discontinued. This process may be repeated as 

20 needed to replace bone mass lost, for example, as a consequence of 
osteoporosis. 

Antibodies specific for HLMP are particularly suitable for use in methods 
for assaying the osteoinductive, that is, bone-forming, potential of patient cells. 
In this way one can identify patients at risk for slow or poor healing of bone 
25 repair. Also, HLMP-specific antibodies are suitable for use in marker assays to 
identify risk factors in bone degenerative diseases, such as, for example, 
osteoporosis. 

Following well known and conventional methods, the genes of the 
present invention are prepared by ligation of nucleic acid segments that 
30 encode LMP to other nucleic acid sequences, such as cloning and/or 
expression vectors. Methods needed to construct and analyze these 
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recombinant vectors, for example, restriction endonuclease digests, cloning 
protocols, mutagenesis, organic synthesis of oligonucleotides and DNA 
sequencing, have been described. For DNA sequencing DNA, the 
dieoxyterminator method is the preferred. 
5 Many treatises on recombinant DNA methods have been published, 

including Sambrook et a/. ( Molecular Cloning: A Laboratory Manual , Cold 
Spring Harbor Press, 2nd edition (1988), Davis e/a/., Basic Methods in 
Molecular Biology. Elsevier (1986), and Ausubel et a/., Current Protocols in 
Molecular Biology . Wiley Interscience (1988). These reference manuals are 
10 specifically incorporated by reference herein. 

Primer-directed amplification of DNA or cDNA is a common step in the 
expression of the genes of this invention. It is typically performed by the 
polymerase chain reaction (PC R). PCR is described in U.S. Patent No. 
4,800,159 to Mullis et al. and other published sources. The basic principle of 
15 PCR is the exponential replication of a DNA sequence by successive cycles of 
primer extension. The extension products of one primer, when hybridized to 
another primer, becomes a template for the synthesis of another nucleic acid 
molecule. The primer-template complexes act as substrate for DNA 
polymerase, which in performing its replication function, extends the primers. 
20 The conventional enzyme for PCR applications is the thermostable DNA 
polymerase isolated from Thermus aquaticus, or Taq DNA polymerase. 

Numerous variations of the basic PCR method exist, and a particular 
procedure of choice in any given step needed to construct the recombinant 
vectors of this invention is readily performed by a skilled artisan. For example, 
25 to measure cellular expression of 10-4/RLMP, RNA is extracted and reverse 
transcribed under standard and well known procedures. The resulting cDNA is 
then analyzed for the appropriate mRNA sequence by PCR. 

The gene encoding the LIM mineralization protein is expressed in an 
expression vector in a recombinant expression system. Of course, the 
30 constructed sequence need not be the same as the original, or its 

complimentary sequence, but instead may be any sequence determined by the 
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degeneracy of the DNA code that nonetheless expresses an LMP having bone 
forming activity. Conservative amino acid substitutions, or other modifications, 
such as the occurrance of an amino-terminal methionine residue, may also be 
employed. 

5 A ribosome binding site active in the host expression system of choice is 

ligated to the 5' end of the chimeric LMP coding sequence, forming a synthetic 
gene. The synthetic gene can be inserted into any one of a large variety of 
vectors for expression by ligating to an appropriately linearized plasmid. A 
regulatable promoter, for example, the £. coli lac promoter, is also suitable for 

10 the expression of the chimeric coding sequences. Other suitable regulatable 
promoters include trp, tac, recA, T7 and lambda promoters. 

DNA encoding LMP is transfected into recipient cells by one of several 
standard published procedures, for example, calcium phosphate precipitation, 
DEAE-Dextran, electroporation or protoplast fusion, to form stable 

15 transformants. Calcium phosphate precipitation is preferred, particularly when 
performed as follows. 

DNAs are coprecipitated with calcium phosphate according to the 
method of Graham and Van Der, Virology . 52:456 (1973), before transfer into 
cells. An aliquot of 40-50 g of DNA, with salmon sperm or calf thymus DNA as 

20 a carrier, is used for 0.5x1 0 6 cells plated on a 100 mm dish. The DNA is mixed 
with 0.5 ml of 2X Hepes solution (280 mM NaCI, 50 mM Hepes and 1 .5 mM 
Na 2 HP0 4 , pH 7.0), to which an equal volume of 2x CaCI 2 (250 mM CaCI 2 and 
10 mM Hepes, pH 7.0) is added. A white granular precipitate, appearing after 
30-40 minutes, is evenly distributed dropwise on the cells, which are allowed to 

25 incubate for 4-16 hours at 37°C. The medium is removed and the cells 

shocked with 15% glycerol in PBS for 3 minutes. After removing the glycerol, 
the cells are fed with Dulbecco's Minimal Essential Medium (DMEM) containing 
10% fetal bovine serum. 

DNA can also be transfected using: the DEAE-Dextran methods of 

30 Kimura et a/., Virology . 49:394 (1972) and Sompayrac et a/., Proc. Natl. Acad. 
Sci. USA . 78:7575 (1981); the electroporation method of Potter, Proc. Natl. 
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Acad. Sci. USA . 81:7161 (1984); and the protoplast fusion method of Sandri- 
Goddin et a/., Molec. Cell. Biol. . 1:743 (1981). 

Phosphoramidite chemistry in solid phase is the preferred method for 
the organic synthesis of oligodeoxynucleotides and polydeoxynucleotides. In 
5 addition, many other organic synthesis methods are available. Those methods 
are readily adapted by those skilled in the art to the particular sequences of the 
invention. 

The present invention also includes nucleic acid molecules that 
hybridize under standard conditions to any of the nucleic acid sequences 

10 encoding the LIM mineralization proteins of the invention. "Standard 

hybridization conditions" will vary with the size of the probe, the background 
and the concentration of the nucleic acid reagents, as well as the type of 
hybridization, for example, in situ, Southern blot, or hybrization of DNA-RNA 
hybrids (Northern blot). The determination of "standard hybridization 

15 conditions" is within the level of skill in the art. For example, see U.S. Patent 
5,580,775 to Fremeau et a/., herein incorporated by reference for this purpose. 
See also, Southern, E. M., J. Mol. Biol. . 98:503 (1975), Alwine et a/., Meth. 
EnzvmoL 68:220 (1979), and Sambrook et a/. f Molecular Cloning: A 
laboratory Manual . 2nd edition, pp. 7.19-7.50, Cold Spring Harbor Press 

20 (1989). 

One preferred set of standard hybrization conditions involves a blot that 
is prehybridized at 42°C for 2 hours in 50% formamide, 5X SSPE (150 nM 
NaCI, 10 mM Na H 2 P0 4 [pH 7.4], 1 mM EDTA [pH 8.0]), 5X Denhardt's solution 
(20 mg Ficoll, 20 mg polyvinylpyrrolidone and 20 mg BSA per 100 ml water), 

25 10% dextran sulphate, 1% SDS and 100 g/ml salmon sperm DNA. A ^P- 
labelled cDNA probe is added, and hybridization is continued for 14 hours. 
Afterward, the blot is washed twice with 2X SSPE, 0.1% SDS for 20 minutes at 
22°C, followed by a 1 hour wash at 65°C in 0.1X SSPE, 0.1 %SDS. The blot is 
then dried and exposed to x-ray film for 5 days in the presence of an 

30 intensifying screen. 
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Under "highly stringent conditions," a probe will hybridize to its target 
sequence if those two sequences are substantially identical. As in the case of 
standard hybridization conditions, one of skill in the art can, given the level of 
skill in the art and the nature of the particular experiment, determine the 
5 conditions under which only susbstantially identical sequences will hybridize. 

Another aspect of the invention includes the proteins encoded by the 
nucleic acid sequences. In still another embodiment, the inventon relates to 
the identification of such proteins based on anti-LMP antibodies. In this 
embodiment, protein samples are prepared for Western blot analysis by lysing 

10 cells and separating the proteins by SDS-PAGE. The proteins are transferred 
io nitrocellulose by eiecirobioUing as described by Ausubei ei ai. t Current 
protocols in Molecular Biology , John^Wiley^and Sons (1987). After blocking 
the filter with instant nonfat dry milk (t gm in 100 ml PBS), anti-LMP antibody is 
added to the filter and incubated for 1 hour at room temperature. The filter is 

15 washed thoroughly with phosphate buffered saline (PBS) and incubated with 
horseradish peroxidase (HRPO)-antibody conjugate for 1 hour at room 
temperature. The filter is again washed thoroughly with PBS and the antigen 
bands are identified by adding diaminobenzidine (DAB). 

Monospecific antibodies are the reagent of choice in the present 

20 invention, and are specifically used to analyze patient cells for specific 
characteristics associated with the expression of LMP. "Monospecific 
antibody" as used herein is defined as a single antibody species or multiple 
antibody species with homogenous binding characteristics for LMP. 
"Homogeneous binding" as used herein refers to the ability of the antibody 

25 species to bind to a specific antigen or epitope, such as those associated with 
LMP, as described above. Monospecific antibodies to LMP are purified from 
mammalian antisera containing antibodies reactive against LMP or are 
prepared as monoclonal antibodies reactive with LMP using the technique of 
Kohler and Milstein, Nature . 256:495-97 (1975). The LMP specific antibodies 

30 are raised by immunizing animals such as, for example, mice, rats, guinea 
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pigs, rabbits, goats or horses, with an appropriate concentration of LMP either 
with or without an immune adjuvant. 

In this process, preimmune serum is collected prior to the first 
immunization. Each animal receives between about 0.1 mg and about 1000 
5 mg of LMP associated with an acceptable immune adjuvant, if desired. Such 
acceptable adjuvants include, but are not limited to, Freund's complete, 
Freund's incomplete, alum-precipitate, water in oil emulsion containing 
Corynebacterium parvum and tRNA adjuvants. The initial immunization 
consists of LMP in, preferably, Freund's complete adjuvant injected at multiple 
10 sites either subcutaneously (SC), intraperitoneal^ (IP) or both. Each animal is 
bled at regular intervals, preferably weekly, to determine antibody titer. The 
animals may or may not receive booster injections following the initial 
immunization. Those animals receiving booster injections are generally given 
an equal amount of the antigen in Freund's incomplete adjuvant by the same 
15 route. Booster injections are given at about three week intervals until maximal 
titers are obtained. At about 7 days after each booster immunization or about 
weekly after a single immunization, the animals are bled, the serum collected, 
and aliquots are stored at about -20° C. 

Monoclonal antibodies (mAb) reactive with LMP are prepared by 
20 immunizing inbred mice, preferably Balb/c mice, with LMP. The mice are 

immunized by the IP or SC route with about 0.1 mg to about 10 mg, preferably 
about 1 mg, of LMP in about 0.5 ml buffer or saline incorporated in an equal 
volume of an acceptable adjuvant, as discussed above. Freund's complete 
adjuvant is preferred. The mice receive an initial immunization on day 0 and 
25 are rested for about 3-30 weeks. Immunized mice are given one or more 

booster immunizations of about 0.1 to about 10 mg of LMP in a buffer solution 
such as phosphate buffered saline by the intravenous (IV) route. Lymphocytes 
from antibody-positive mice, preferably splenic lymphocytes, are obtained by 
removing the spleens from immunized mice by standard procedures known in 
30 the art. Hybridoma cells are produced by mixing the splenic lymphocytes with 
an appropriate fusion partner, preferably myeloma cells, under conditions 
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which will allow the formation of stable hybridomas. Fusion partners may 
include, but are not limited to: mouse myelomas P3/NS1/Ag 4-1 ; MPC-1 1 ; S- 
194 and Sp 2/0, with Sp 2/0 being preferred. The antibody producing cells and 
myeloma cells are fused in polyethylene glycol, about 1000 mol. wt., at 
5 concentrations from about 30% to about 50%. Fused hybridoma cells are 
selected by growth in hypoxanthine, thymidine and aminopterin in 
supplemented Dulbecco's Modified Eagles Medium (DMEM) by procedures 
known in the art. Supernatant fluids are collected from growth positive wells on 
about days 14, 18, and 21, and are screened for antibody production by an 

10 immunoassay such as solid phase immunoradioassay (SPIRA) using LMP as 
the antigen. The culture fluids are aiso tested in the Ouchteriony precipitation 
assay to determine the isotype of the mAb. Hybridoma cells from antibody 
positive wells are cloned by a technique such as the soft agar technique of 
MacPherson, "Soft Agar Techniques", in Tissue Culture Methods and 

15 Applications . Kruse and Paterson (eds.), Academic Press (1973). See, also, 
Harlow et a/., Antibodies: A Laboratory Manual . Cold Spring Laboratory 
(1988). 

Monoclonal antibodies may also be produced in vivo by injection of 
pristane- primed Balb/c mice, approximately 0.5 ml per mouse, with about 
20 2x1 0 6 to about 6x1 0 6 hybridoma cells about 4 days after priming. Ascites fluid 
is collected at approximately 8-12 days after cell transfer and the monoclonal 
antibodies are purified by techniques known in the art. 

In vitro production in anti-LMP mAb is carried out by growing the 
hydridoma cell line in DMEM containing about 2% fetal calf serum to obtain 
25 sufficient quantities of the specific mAb. The mAb are purified by techniques 
known in the art. 

Antibody titers of ascites or hybridoma culture fluids are determined by 
various serological or immunological assays, which include, but are not limited 
to, precipitation, passive agglutination, enzyme-linked immunosorbent antibody 
30 (ELISA) technique and radioimmunoassay (RIA) techniques. Similar assays 
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are used to detect the presence of the LMP in body fluids or tissue and cell 
extracts. 

It is readily apparent to those skilled in the art that the above described 
methods for producing monospecific antibodies may be utilized to produce 
5 antibodies specific for polypeptide fragments of LMP, full-length nascent LMP 
polypeptide, or variants or alleles thereof. 

In another embodiment, the invention is directed to alternative splice 
variants of HLMP-1 . PCR analysis of human heart cDNA revealed mRNA for 
two HLMP alternative splice variants, named HLMP-2 and HLMP-3, that differ 
10 from HLMP-1 in a region between base pairs 325 and 444 in the hLMP-1 
sequence. The HLMP-2 sequence has a 1 19 base pair deletion and an 
---- - insertion of 17 base pairs in this region. These changes preserve the reading 

frame, resulting in a 423 amino acid protein, which compared to HLMP-1, has a 
net loss of 34 amino acids (40 amino acids deleted plus 6 inserted amino 
15 acids). HLMP-2 contains the c-terminal LIM domains that are present in 
HLMP-1. 

Compared to HLMP-1, HLMP-3 has no deletions, but it does have the 
same 17 base pair insertion at position 444. This insertion shifts the reading 
frame, causing a stop codon at base pairs 459-461. As a result, HLMP-3 
20 encodes a protein of 153 amino acids. This protein lacks the c-terminal LIM 
domains that are present in HLMP-1 and HLMP-2. The predicted size of the 
proteins encoded by HLMP-2 and HLMP-3 was confirmed by western blot 
analysis. 

PCR analysis of the tissue distribution of the three splice variants 
25 revealed that they are differentially expressed, with specific isoforms 

predominating in different tissues. HLMP-1 is apparently the predominant form 
expressed in leukocytes, spleen, lung, placenta, and fetal liver. HLMP-2 
appears to be the predominant isoform in skeletal muscle, bone marrow, and 
heart tissue. HLMP-3, however, was not the predominant isoform in any tissue 
30 examined. 
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Overexpression of HLMP-3 in secondary rat osteoblast cultures 
induced bone nodule formation (287±56) similar to the effect seen for 
glucocorticoid (272±7) and HLMP-1 (232±200). Since HLMP-3 lacks the C- 
terminal LIM domains, there regions are not required for osteoinductive activity. 
5 Overexpression of HLMP-2, however, did not induce nodule formation (1 1±3). 
These data suggest that the amino acids encoded by the deleted 119 base 
pairs are necessary for osteoinduction. The data also suggest that the 
distribution of HLMP splice variants may be important for tissue-specific 
function. Surprisingly, we have shown that HLMP-2 inhibits steroid-induced 

10 osteoblast formation in secondary rat osteoblast cultures. Therefore, HLMP-2 
will have therapeutic utility in clinical situations where bone formation is not 
desirable. - - - = ^- . 

On July 22, 1997, a sample of 10-4/RLMP in a vector designated 
pCMV2/RLMP (which is vector pRc/CMV2 with insert 10-4 clone/RLMP) was 

15 deposited with the American Type Culture Collection (ATCC), 12301 Parklawn 
Drive, Rockville, MD 20852. The culture accession number for that deposit is 
209153. On March 19, 1998, a sample of the vector pHis-A with insert HLPM- 
1s was deposited at the American Type Culture Collection ("ATCC"). The 
culture accession number for that deposit is 209698. On April 14, 2000, 

20 samples of plasmids pHAhLMP-2 (vector pHisA with cDNA insert derived from 
human heart muscle cDNA with HLMP-2) and pHAhLMP-3 (vector pHisA with 
cDNA insert derived from human heart muscle cDNA with HLMP-3) were 
deposited with the ATCC, 10801 University Blvd., Manassas, VA, 20110-2209, 
USA, under the conditions of the Budapest treaty. The accession numbers for 

25 these deposits are and , respectively. These deposits, as 

required by the Budapest Treaty, will be maintained in the ATCC for at least 30 
years and will be made available to the public upon the grant of a patent 
disclosing them. It should be understood that the availability of a deposit does 
not constitute a license to practice the subject invention in derogation of patent 

30 rights granted by government action. 
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In assessing the nucleic acids, proteins, or antibodies of the invention, 
enzyme assays, protein purification, and other conventional biochemical 
methods are employed. DNA and RNA are analyzed by Southern blotting and 
Northern blotting techniques, respectively. Typically, the samples analyzed are 
5 size fractionated by gel electrophoresis. The DNA or RNA in the gels are then 
transferred to nitrocellulose or nylon membranes. The blots, which are replicas 
of sample patterns in the gels, were then hybridized with probes. Typically, the 
probes are radiolabeled, preferably with 32 P, although one could label the 
probes with other signal-generating molecules known to those in the art. 
o Specific bands of interest can then be visualized by detection systems, such as 
autoradiography. 

For purposes of illustrating preferred embodiments of the present 
invention, the following, non-limiting examples are included. These results 
demonstrate the feasibiliity of inducing or enhancing the formation of bone 
5 using the LIM mineralization proteins of the invention, and the isolated nucleic 
acid molecules encoding those proteins. 

Example 1: Calvarial Cell Culture 

Rat calvarial cells, also known as rat osteoblasts ("ROB"), were obtained 

from 

20-day pre-parturition rats as previously described. Boden et a/., 
Endocrinology . 137(8):3401-07 (1996). Primary cultures were grown to 
confluence (7 days), trypsinized, and passed into 6-well plates (1 x 10 5 cells/35 
mm well) as first subculture cells. The subculture cells, which were confluent 
at day 0, were grown for an additional 7 days. Beginning on day 0, media were 
changed and treatments (Trm and/or BMPs) were applied, under a laminar flow 
hood, every 3 or 4 days. The standard culture protocol was as follows: days 1- 
7, MEM, 10% FBS, 50 g/ml ascorbic acid, ± stimulus; days 8-14, BGJb 
medium, 10% FBS, 5mM -GlyP (as a source of inorganic phosphate to permit 
mineralization). Endpoint analysis of bone nodule formation and osteocalcin 
secretion was performed at day 14. The dose of BMP was chosen as 50 ng/ml 
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based on pilot experiments in this system that demonstrated a mid-range effect 
on the dose-response curve for all BMPs studied. 



EXAMPLE 2: Antisense Treatment and Cell Culture 

5 

To explore the potential functional role of LMP-1 during membranous 
bone formation, we synthesized an antisense oligonucleotide to block LMP-1 
mRNA translation and treated secondary osteoblast cultures that were 
undergoing differentiation initiated by glucocorticoid. Inhibition of RLMP 

10 expression was accomplished with a highly specific antisense oligonucleotide 
(having no significant homologies to known rat sequences) corresponding to a 
25 bp sequence spanning the putative translational start site (SEQ ID NO: 42). 
Control cultures either did not receive oligonucleotide or they received sense 
oligonucleotide. Experiments were performed in the presence (preincubation) 

15 and absence of lipofectamine. Briefly, 22 g of sense or antisense RLMP 
oligonucleotide was incubated in MEM for 45 minutes at room temperature. 
Following that incubation, either more MEM or pre-incubated 
lipofectamine/MEM (7% v/v; incubated 45 minutes at room temperature) was 
added to achieve an oligonucleotide concentration of 0.2 M. The resulting 

20 mixture was incubated for 15 minutes at room temperature. Oligonucleotide 
mixtures were then mixed with the appropriate medium, that is, 
MEM/Ascorbate/±Trm, to achieve a final oligonucleotide concentration of 0.1 
M. 

Cells were incubated with the appropriate medium (tstimulus) in the 
25 presence or absence of the appropriate oligonucleotides. Cultures originally 
incubated with lipofectamine were re-fed after 4 hours of incubation (37°C; 5% 
C0 2 ) with media containing neither lipofectamine nor oligonucleotide. All 
cultures, especially cultures receiving oligonucleotide, were re-fed every 24 
hours to maintain oligonucleotide levels. 
30 LMP-1 antisense oligonucleotide inhibited mineralized nodule formation 

and osteocalcin secretion in a dose-dependent manner, similar to the effect of 
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BMP-6 oligonucleotide. The LMP-1 antisense block in osteoblast 
differentiation could not be rescued by addition of exogenous BMP-6, while the 
BMP-6 antisense oligonucleotide inhibition was reversed with addition of BMP- 
6. This experiment further confirmed the upstream position of LMP-1 relative 
5 to BMP-6 in the osteoblast differentiation pathway. LMP-1 antisense 

oligonucleotide also inhibited spontaneous osteoblast differentiation in primary 
rat osteoblast cultures. 



EXAMPLE 3: Quantitation of Mineralized Bone Nodule Formation 

10 

Cultures of ROBs prepared according to Examples 1 and 2 were fixed 
overnight in 70% ethanol and stained with von Kossa silver stain. A semi- 
automated computerized video image analysis system was used to quantitate 
nodule count and nodule area in each well. Boden et a/., Endocrinology . 

15 137(8):3401-07 (1996). These values were then divided to calculate the area 
per nodule values. This automated process was validated against a manual 
counting technique and demonstrated a correlation coefficient of 0.92 
(p < 0.000001). All data are expressed as the mean ± standard error of the 
mean (S.E.M.) calculated from 5 or 6 wells at each condition. Each experiment 

20 was confirmed at least twice using cells from different calvarial preparations. 

EXAMPLE 4: Quantitation of Osteocalcin Secretion 



Osteocalcin levels in the culture media were measured using a 
25 competitive radioimmunoassay with a monospecific polyclonal antibody (Pab) 
raised in our laboratory against the C-terminal nonapeptide of rat osteocalcin 
as described in Nanes et a/., Endocrinology . 127:588 (1990). Briefly, 1 g of 
nonapeptide was iodinated with 1 mCi 125 l-Na by the lactoperoxidase method. 
Tubes containing 200 I of assay buffer (0.02 M sodium phosphate, 1 mM 
30 EDTA, 0.001% thimerosal, 0.025% BSA) received media taken from cell 
cultures or osteocalcin standards (0 - 12,000 fmole) at 100 l/tube in assay 
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buffer. The Pab (1:40,000; 100 I) was then added, followed by the iodinated 
peptide (12,000 cpm; 100 I). Samples tested for non-specific binding were 
prepared similarly but contained no antibody. 

Bound and free PAbs were separated by the addition of 700 I goat anti- 
5 rabbit IgG, followed by incubation for 18 hours at 4°C. After samples were 
centrifuged at 1200 rpm for 45 minutes, the supernatants were decanted and 
the precipitates counted in a gamma counter. Osteocalcin values were 
reported in fmole/100 I, which was then converted to pmole/ml medium (3-day 
production) by dividing those values by 100. Values were expressed as the 
10 mean ± S.E.M. of triplicate determinations for 5-6 wells for each condition. 
Each experiment was confirmed at least two times using cells from different 
calvarial preparations. 

EXAMPLE 5: Effect of Trm and RLMP on Mineralization In Vitro 

15 

There was little apparent effect of either the sense or antisense 
oligonucleotides on the overall production of bone nodules in the non- 
stimulated cell culture system. When ROBs were stimulated with Trm, 
however, the antisense oligonucleotide to RLMP inhibited mineralization of 
20 nodules by > 95%. The addition of exogenous BMP-6 to the oligonucleotide- 
treated cultures did not rescue the mineralization of RLMP-antisense-treated 
nodules. 

Osteocalcin has long been synonymous with bone mineralization, and 
osteocalcin levels have been correlated with nodule production and 

25 mineralization. The RLMP-antisense oligonucleotide significantly decreases 
osteocalcin production, but the nodule count in antisense-treated cultures does 
not change significantly. In this case, the addition of exogenous BMP-6 only 
rescued the production of osteocalcin in RLMP-antisense-treated cultures by 
10-15%. This suggests that the action of RLMP is downstream of, and more 

30 specific than, BMP-6. 

-21 - 

3NSDOCIO. <WO 00661 7BA1_IA> 



WO 00/66178 



PCT/USOO/11664 



EXAMPLE 6: Harvest and Purification of RNA 

Cellular RNA from duplicate wells of ROBs (prepared according to 
5 Examples 1 and 2 in 6-well culture dishes) was harvested using 4M guanidine 
isothiocyanate (GIT) solution to yield statistical triplicates. Briefly, culture 
supernatant was aspirated from the wells, which were then overlayed with 0.6 
ml of GIT solution per duplicate well harvest. After adding the GIT solution, the 
plates were swirled for 5-10 seconds (being as consistent as possible). 

10 Samples were saved at -70°C for up to 7 days before further processing. 
RNA was purified by a slight modification of standard methods 
according to Sambrook et a/., Molecular Cloning: a Laboratory Manual . 2nd 
Ed., chapter 7.19, Cold Spring Harbor Press (1989). Briefly, thawed samples 
received 60 I 2.0 M sodium acetate (pH 4.0), 550 I phenol (water saturated) 

15 and 150 I chlorofomr.isoamyl alcohol (49:1). After vortexing, the samples were 
centrifuged (10000 x g; 20 minutes; 4°C), the aqueous phase transferred to a 
fresh tube, 600 I isopropanol was added and the RNA precipitated overnight at 
-20°C. 

Following the overnight incubation, the samples were centrifuged 
20 (10000 x g; 20 minutes) and the supernatant was aspirated gently. The pellets 
were resuspended in 400 I DEPC-treated water, extracted once with 
phenolxhloroform (1:1), extracted with chloroform:isoamyl alcohol (24:1) and 
precipitated overnight at -20°C after addition of 40 I sodium acetate (3.0 M; pH 
5.2) and 1.0 ml absolute ethanol. To recover the cellular RNA, the samples 
25 were centrifuged (10000 x g; 20 min), washed once with 70% ethanol, air dried 
for 5-10 minutes and resuspended in 20 I of DEPC-treated water. RNA 
concentrations were calculated from optical densities that were determined 
with a spectrophotometer. 
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EXAMPLE 7: Reverse Transcription-Polvmerase Chain Reaction 

Heated total RNA (5 g in 10.5 I total volume DEPC-H 2 0 at 65°C for 5 
minutes) was added to tubes containing 4 I 5X MMLV-RT buffer, 2 I dNTPs, 2 I 
5 dT17 primer (10 pmol/ml), 0.5 I RNAsin (40U/ml) and 1 I MMLV-RT (200 
units/ 1). The samples were incubated at 37°C for 1 hour, then at 95°C for 5 
minutes to inactivate the MMLV-RT. The samples were diluted by addition of 
80 I of water. 

Reverse-transcribed samples (5 I) were subjected to polymerase-chain 
10 reaction using standard methodologies (50 1 total volume). Briefly, samples 

VA/prp arid&r\ tn tnhpQ rnntaininn w/ator anH anr^mnriaici amnnnfe of Df^P huffor 

25 mM MgCI 2 , dNTPs, forward and reverse primers for glyceraldehyde 3- 
phosphate dehydrogenase (GAP, a housekeeping gene) and/or BMP-6), 32 P- 
dCTP, and Taq polymerase. Unless otherwise noted, primers were 
15 standardized to run consistently at 22 cycles (94°C, 30"; 58°C, 30"; 72°C, 20"). 

EXAMPLE 8: Quantitation of RT-PCR Products by Polvacrvlamide Gel 
Electrophoresis (PAGE^and Phosphorlmager Analysis 

20 

RT-PCR products received 5 l/tube loading dye, were mixed, heated at 
65°C for 10 min and centrifuged. Ten I of each reaction was subjected to 
PAGE (12% polyacrylamide:bis; 15 V/well; constant current) under standard 
conditions. Gels were then incubated in gel preserving buffer (10% v/v 
25 glycerol, 7% v/v acetic acid, 40% v/v methanol, 43% deionized water) for 30 
minutes, dried (80°C) in vacuo for 1-2 hours and developed with an 
electronically-enhanced phosphoresence imaging system for 
6-24 hours. Visualized bands were analyzed. Counts per band were plotted 
graphically. 

30 
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EXAMPLE 9: Differential Display PCR 

RNA was extracted from cells stimulated with glucocorticoid (Trm, 1 
nM). Heated, DNase-treated total RNA (5 g in 10.5 I total volume in DEPC- 
5 H 2 0 at 65°C for 5 minutes) was reverse transcribed as described in Example 7, 
but H-T n M (SEQ ID. NO: 4) was used as the MMLV-RT primer. The resulting 
cDNAs were PCR-ampIified as described above, but with various commercial 
primer sets (for example, 

H-T^G (SEQ ID NO: 4) and H-AP-10 (SEQ ID. NO: 5); GenHunter Corp, 
10 Nashville, TN). Radiolabeled PCR products were fractionated by gel 

electrophoresis on a DNA sequencing gei. After electrophoresis, the resulting 
„ gels were dried in vacuo and autoradiographs were exposed overnight. Bands 

representing differentially-expressed cDNAs were excised from the gel and 

reamplified by PCR using the method of Conner et a/., Proc. Natl. Acad. Sci. 
15 USA . 88:278 (1983). The products of PCR reamplification were cloned into 

the vector PCR-II (TA cloning kit; InVitrogen, Carlsbad, CA). 

EXAMPLE 10: Screening of a UMR 106 Rat Osteosarcoma Cell cDNA Library 

20 A UMR 106 library (2.5 x 10 10 pfu/ml) was plated at 5 x 10 4 pfu/ml onto 

agar plates (LB bottom agar) and the plates were incubated overnight at 37°C. 
Filter membranes were overlaid onto plates for two minutes. Once removed, 
the filters were denatured, rinsed, dried and UV cross-linked. The filters were 
then incubated in pre-hyridization buffer (2X PIPES [pH 6.5J, 5% formamide, 

25 1 % SDS and 1 00 g/ml denatured salmon sperm DNA) for 2 h at 42°C. A 260 
base-pair radiolabeled probe (SEQ ID NO: 3; 32 P labelled by random priming) 
was added to the entire hybridization mix/filters, followed by hybridization for 18 
hours at 42°C. The membranes were washed once at room temperature (10 
min, 1 x SSC, 0.1% SDS) and three times at 55°C (15 min, 0.1 x SSC, 0.1% 

30 SDS). 

-24- 



BNSOOCID: <WO 0066178A1_IA> 



WO 00/66178 



PCT/US00/1 1 664 



After they were washed, the membranes were analyzed by 
autoradiography as described above. Positive clones were plaque purified. 
The procedure was repeated with a second filter for four minutes to minimize 
spurious positives. Plaque-purified clones were rescued as lambda SK(-) 
5 phagemids. Cloned cDNAs were sequenced as described below. 

EXAMPLE 11: Sequencing of Clones 

Cloned cDNA inserts were sequenced by standard methods. Ausubel et 
a/., Current Protocols in Molecular Biology . Wiley Interscience (1988). Briefly, 
appropriate concentrations of termination mixture, template and reaction 
mixture were subjected to an appropriate cycling protocol (95°C,30s; 68°C,30s; 
72°C t 60s; x 25). Stop mixture was added to terminate the sequencing 
reactions. After heating at 92°C for 3 minutes, the samples were loaded onto a 
denaturing 6% polyacrylamide sequencing gel (29:1 acrylamide:bis- 
acrylamide). Samples were electrophoresed for about 4 hours at 60 volts, 
constant current. After electrophoresis, the gels were dried in vacuo and 
autoradiographed. 

The autoradiographs were analyzed manually. The resulting sequences 
were screened against the databases maintained by the National Center for 
Biotechnology Information (NIH, Bethesda, MD; http://www.ncbi.nlm.nih.gov/) 
using the BLASTn program set with default parameters. Based on the 
sequence data, new sequencing primers were prepared and the process was 
repeated until the entire gene had been sequenced. All sequences were 
confirmed a minimum of three times in both orientations. 

Nucleotide and amino acid sequences were also analyzed using the 
PCGENE software package (version 16.0). Per cent homology values for 
nucleotide sequences were calculated by the program NALIGN, using the 
following parameters: weight of non-matching nucleotides, 10; weight of non- 
matching gaps, 10; maximum number of nucleotides considered, 50; and 
minimum number of nucleotides considered, 50. 
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For amino acid sequences, per cent homology values were calculated 
using PALIGN. A value of 10 was selected for both the open gap cost and the 
unit gap cost 



5 EXAMPLE 12: Cloning of RLMP cDNA 



The differential display PCR amplification products described in 
Example 9 contained a major band of approximately 260 base pairs. This 
sequence was used to screen a rat osteosarcoma (UMR 106) cDNA library. 

10 Positive clones were subjected to nested primer analysis to obtain the primer 
sequences necessary for amplifying the full length cDNA. (SEQ. ID NOs: 1 1 , 
12, 29, 30 and 31) One of those positive clones selected for further study was 
designated clone 1 0-4. ----- " 

Sequence analysis of the full-length cDNA in clone 10-4, determined by 

15 nested primer analysis, showed that clone 10-4 contained the original 260 
base-pair fragment identified by differential display PCR. Clone 10-4 (1696 
base pairs; SEQ ID NO: 2) contains an open reading frame of 1371 base pairs 
encoding a protein having 457 amino acids (SEQ ID NO: 1). The termination 
codon, TGA, occurs at nucleotides 1444-1446. The polyadenylation signal at 

20 nucleotides 1675-1680, and adjacent poly(A) + tail, was present in the 3' 

noncoding region. There were two potential N-glycosylation sites, Asn-Lys-Thr 
and Asn-Arg-Thr, at amino acid positions 113-116 and 257-259 in SEQ ID NO: 
1, respectively. Two potential cAMP- and cGMP-dependent protein kinase 
phosphorylation sites, Ser and Thr, were found at amino acid positions 191 

25 and 349, respectively. There were five potential protein kinase C 

phosphorylation sites, Ser or Thr, at amino acid positions 3, 1 15, 166, 219, 
442. One potential ATP/GTP binding site motif A (P-loop), Gly-Gly-Ser-Asn- 
Asn-Gly-Lys-Thr, was determined at amino acid positions 272-279. 

In addition, two highly conserved putative LIM domains were found at 

30 amino acid positions 341-391 and 400-451 . The putative LIM domains in this 
newly identified rat cDNA clone showed considerable homology with the LIM 
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domains of other known LIM proteins. However, the overall homology with 
other rat LIM proteins was less than 25%. RLMP (also designated 10-4) has 
78.5% amino acid homology to the human enigma protein (see U.S. Patent No. 
5,504,192), but only 24.5% and 22.7% amino acid homology to its closest rat 
5 homologs, CLP-36 and RIT-18, respectively. 

EXAMPLE 13: Northern Blot Analysis of RLMP Expression 

Thirty g of total RNA from ROBs, prepared according to Examples 1 and 
10 2, was size fractionated by formaldehyde gel electrophoresis in 1% agarose 
flatbed gels and osmotically transblotted to nylon membranes. The blot was 
probed with a 600 base pair Eco R1 fragment of full-length 10-4 cDNAJabeled 
with 32 P-dCTP by random priming. 

Northern blot analysis showed a 1.7 kb mRNA species that hybridized 
15 with the RLMP probe. RLMP mRNA was up-regulated approximately 3.7-fold 
in ROBs after 24 hours exposure to BMP-6. No up-regulation of RMLP 
expression was seen in BMP-2 or BMP-4-stimulated ROBs at 24 hours. 

EXAMPLE 14: Statistical Methods 

20 

For each reported nodule/osteocalcin result, data from 5-6 wells from a 
representative experiment were used to calculate the mean ± S.E.M. Graphs 
may be shown with data normalized to the maximum value for each parameter 
to allow simultaneous graphing of nodule counts, mineralized areas and 

25 osteocalcin. 

For each reported RT-PCR, RNase protection assay or Western blot 
analysis, data from triplicate samples of representative experiments, were used 
to determine the mean ± S.E.M. Graphs may be shown normalized to either 
day 0 or negative controls and expressed as fold-increase above control 

30 values. 
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Statistical significance was evaluated using a one-way analysis of 
variance with post-hoc multiple comparison corrections of Bonferroni as 
appropriate. D. V. Huntsberger, "The Analysis of Variance," in Elements of 
Statistical Variance . P. Billingsley (ed.), pp. 298-330, Allyn & Bacon Inc., 
5 Boston, MA (1977) and Sigmastat, Jandel Scientific, Corte Madera, CA. Alpha 
levels for significance were defined as p < 0.05. 

EXAMPLE 15: Detection of Rat LIM Mineralization Protein by Western Blot 
Analysis 

10 

Polyclonal antibodies were prepared according to the methods of 
England et a/., Biochim.Biophys. Acta . 623:171 (1980) and Timmer et a/., JL 
Biol. Chem ~ 268:24863 (1 993)7 • 

HeLa cells were transfected with pCMV2/RLMP. Protein was harvested 
15 from the transfected cells according to the method of Hair et a/., Leukemia 
Research . 20:1 (1996). Western Blot Analysis of native RLMP was performed 
as described by Towbin et a/., Proc. Natl. Acad. ScL USA . 76:4350 (1979). 

EXAMPLE 16: Synthesis of the Rat LMP-Unique (RLMPU) derived Human 
20 PCR product 

Based on the sequence of the rat LMP-1 cDNA, forward and reverse 
PCR primers (SEQ ID NOs: 15 and 16) were synthesized and a unique 223 
base-pair sequence was PCR amplified from the rat LMP-1 cDNA. A similar 

25 PCR product was isolated from human MG63 osteosarcoma cell cDNA with the 
same PCR primers. 

RNA was harvested from MG63 osteosarcoma cells grown in T-75 
flasks. Culture supernatant was removed by aspiration and the flasks were 
overlayed with 3.0 ml of GIT solution per duplicate, swirled for 5-10 seconds, 

30 and the resulting solution was transferred to 1 .5 ml eppendorf tubes (5 tubes 
with 0.6 ml/tube). RNA was purified by a slight modification of standard 
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methods, for example, see Sambrook ef a/., Molecular Cloning: A Laboratory 
Manual, chapter 7, page 19, Cold Spring Harbor Laboratory Press (1989) and 
Boden et a/., Endocrinology . 138:2820-28 (1997). Briefly, the 0.6 ml samples 
received 60 I 2.0 M sodium acetate (pH 4.0), 550 I water saturated phenol and 
5 1 50 I chloroform:isoamyl alcohol (49:1 ). After addiiton of those reagents, the 
samples were vortexed, centrifuged (10000 x. g; 20 min; 4C) and the aqueous 
phase transferred to a fresh tube. Isopropanol (600 I) was added and the RNA 
was precipitated overnight at -20°C. The samples were centrifuged (10000 x 
g; 20 minutes) and the supernatant was aspirated gently. The pellets were 

10 resuspended in 400 I of DEPC-treated water, extracted once with 

phenoixhioroform (1:1), extracted with chioroform;isoamyi aicohoi (24:1) and 
precipitated overnight at -20°C in 40 I so dium acetate (3.0 M; pH 5.2) and 1 .0 
ml absolute ethanol. After precipitation, the samples were centrifuged (10000 
x g; 20 min), washed once with 70% ethanol, air dried for 5-10 minutes and 

15 resuspended in 20 I of DEPC-treated water. RNA concentrations were derived 
from optical densities. 

Total RNA (5 g in 10.5 L total volume in DEPC-H 2 0) was heated at 65°C 
for 5 minutes, and then added to tubes containing 4 I 5X MMLV-RT buffer, 2 I 
dNTPs, 2 I dT17 primer (10 pmol/ml), 0.5 I RNAsin (40 U/ml) and 1 I MMLV-RT 

20 (200 units/ 1). The reactions were incubated at 37°C for 1 hour. Afterward, the 
MMLV-RT was inactivated by heating at 95°C for 5 minutes. The samples 
were diluted by addition of 80 L water. 

Transcribed samples (5 I) were subjected to polymerase-chain reaction 
using standard methodologies (50 I total volume). Boden et a/.. Endocrinology . 

25 138:2820-28 (1997); Ausubel et a/. t "Quantitation of rare DNAs by the 

polymerase chain reaction", in Current Protocols in Molecular Biology , chapter 
15.31-1, Wiley & Sons, Trenton, NJ (1990). Briefly, samples were added to 
tubes containing water and appropriate amounts of PCR buffer (25 mM MgC1 2 , 
dNTPs, forward and reverse primers (for RLMPU; SEQ ID NOs: 15 and 16), 

30 32 P-dCTP, and DNA polymerase. Primers were designed to run consistently at 
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22 cycles for radioactive band detection and 33 cycles for amplification of PCR 
product for use as a screening probe (94°C, 30 sec, 58°C, 30 sec; 72°C, 20 
sec). 

Sequencing of the agarose gel-purified MG63 osteosarcoma-derived 
5 PCR product gave a sequence more than 95% homologous to the RLMPU 
PCR product. That sequence is designated HLMP unique region (HLMPU; 
SEQ ID NO: 6). 

EXAMPLE 17: Screening of reverse-transcriptase-derived MG63 cDNA 

10 

Screening was performed with PCR using specific primers (SEQ ID 
NOs: 16 and 17) as described in Example 7. A 717 base-pair MG63 PCR 
" product was agarose gel purified and sequenced with the given primers (SEQ. 
ID NOs: 12, 15, 16, 17, 18, 27 and 28). Sequences were confirmed a 
5 minimum of two times in both directions. The MG63 sequences were aligned 
against each other and then against the full-length rat LMP cDNA sequence to 
obtain a partial human LMP cDNA sequence (SEQ ID NO: 7). 

EXAMPLE 18: Screening of a Human Heart cDNA Library 

Based on Northern blot experiments, it was determined that LMP-1 is 
expressed at different levels by several different tissues, including human heart 
muscle. A human heart cDNA library was therefore examined. The library was 
plated at 5 x 10 4 pfu/ml onto agar plates (LB bottom agar) and plates were 
grown overnight at 37° C. Filter membranes were overlaid onto the plates for 
two minutes. Afterward, the filters denatured, rinsed, dried, UV cross-linked 
and incubated in pre-hyridization buffer (2X PIPES [pH 6.5]; 5% formamide, 
1% SDS, 100 g/ml denatured salmon sperm DNA) for 2 h at 42°C. A 
radiolabeled, LMP-unique, 223 base-pair probe ( 32 P, random primer labelling; 
SEQ ID NO: 6) was added and hybridized for 18 h at 42°C. Following 
hybridization, the membranes were washed once at room temperature (10 min, 
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1 x SSC, 0.1% SDS) and three times at 55°C (15 min t 0.1 x SSC, 0.1% SDS). 
Double-positive plaque-purified heart library clones, identified by 
autoradiography, were rescued as lambda phagemids according to the 
manufacturers' protocols (Stratagene, La Jolla, CA). 
5 Restriction digests of positive clones yielded cDNA inserts of varying 

sizes. Inserts greater than 600 base-pairs in length were selected for initial 
screening by sequencing. Those inserts were sequenced by standard 
methods as described in Example 1 1 . 

One clone, number 7, was also subjected to automated sequence 
10 analysis using primers corresponding to SEQ ID NOs: 1 1-14, 16 and 27. The 
sequences obtained by these methods were routinely 97-100% homologous. 
Clone 7 (Partial Human LMP-1 cDNA from a heart library; SEQ. ID NO: 8) 
contained sequence that was more than 87% homologous to the rat LMP 
cDNA sequence in the translated region. 

15 

EXAMPLE 19: Determination of Full-Length Human LMP-1 cDNA 

Overlapping regions of the MG63 human osteosarcoma cell cDNA 
sequence and the human heart cDNA clone 7 sequence were used to align 

20 those two sequences and derive a complete human cDNA sequence of 1644 
base-pairs. NALIGN, a program in the PCGENE software package, was used 
to align the two sequences. The overlapping regions of the two sequences 
constituted approximately 360 base-pairs having complete homology except for 
a single nucleotide substitution at nucleotide 672 in the MG63 cDNA (SEQ ID 

25 NO: 7) with clone 7 having an "A" instead of a W G" at the corresponding 
nucleotide 516 (SEQ ID NO: 8). 

The two aligned sequences were joined using SEQIN, another 
subprogram of PCGENE, using the "G" substitution of the MG63 osteosarcoma 
cDNA clone. The resulting sequence is shown in SEQ ID NO: 9. Alignment of 

30 the novel human-derived sequence with the rat LMP-1 cDNA was 

accomplished with NALIGN. The full-length human LMP-1 cDNA sequence 
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(SEQ. ID NO: 9) is 87.3% homologous to the translated portion of rat LMP-1 
cDNA sequence. 

EXAMPLE 20: Determination of Amino Acid Sequence of Human LMP-1 

5 

The putative amino acid sequence of human LMP-1 was determined 
with the PCGENE subprogram TRANSL. The open reading frame in SEQ ID 
NO: 9 encodes a protein comprising 457 amino acids (SEQ. ID NO: 10). Using 
the PCGENE subprogram Palign, the human LMP-1 amino acid sequence was 
10 found to be 94.1% homologous to the rat LMP-1 amino acid sequence. 

EXAMPLE 21: Determination of the 5 Prime Untranslated Region of the 
Human LMP^cDNA 

15 MG63 5' cDNA was amplified by nested RT-PCR of MG63 total RNA 

using a 5* rapid amplification of cDNA ends (5' RACE) protocol. This method 
included first strand cDNA synthesis using a lock-docking oligo (dT) primer with 
two degenerate nucleotide positions at the 3' end (Chenchik et a/., 
CLONTECHniques. X:5 (1995); Borson et aL t PC Methods Applies 2:144 

20 (1993)). Second-strand synthesis is performed according to the method of 
Gubler et a/., Gene. 25:263 (1983), with a cocktail of Escherichia coli DNA 
polymerase I, RNase H, and E. coli DNA ligase. After creation of blunt ends 
with T4 DNA polymerase, double-stranded cDNA was ligated to the fragment 
(5' -CTAATACGACTCACTATAGGGCTCGAGCGGCCGCCCGGGCAGGT- 3 1 ) 

25 (SEQ.ID NO: 19). Prior to RACE, the adaptor-ligated cDNA was diluted to a 
concentration suitable for Marathon RACE reactions (1:50). Adaptor-ligated 
double-stranded cDNA was then ready to be specifically cloned. 

First-round PCR was performed with the adaptor-specific 
oligonucleotide, 5-CCATCCTAATACGACTCACTATAGGGC- 3' (AP1) (SEQ.ID 

30 NO: 20) as sense primer and a Gene Specific Primer (GSP) from the unique 
region described in Example 16 (HLMPU). The second round of PCR was 
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performed using a nested primers GSP1-HLMPU (antisense/reverse primer) 
(SEQ. ID NO: 23) and GSP2-HLMPUF (SEQ. ID NO: 24) (see Example 16; 
sense/forward primer). PCR was performed using a commercial kit (Advantage 
cDNA PCR core kit; CloneTech Laboratories Inc., Palo Alto, CA) that utilizes 
5 an antibody-mediated, but otherwise standard, hot-start protocol. PCR 

conditions for MG63 cDNA included an initial hot-start denaturation (94°C, 60 
sec) followed by: 94°C, 30 sec; 60°C, 30 sec; 68°C, 4 min; 30 cycles. The first- 
round PCR product was approximately 750 base-pairs in length whereas the 
nested PCR product was approximately 230 base-pairs. The first-round PCR 
10 product was cloned into linearized pCR 2.1 vector (3.9 Kb). The inserts were 
sequenced in both directions using M13 Forward and Reverse primers (SEQ, 
ID NO: 11; SEQ. ID NO: 12) 

EXAMPLE 22: Determination of Full-length Human LMP-1 cDNA with 5 Prime 
15 UTR 



Overlapping MG63 human osteosarcoma cell cDNA S'-UTR sequence 
(SEQ ID NO: 21), MG63 717 base-pair sequence (Example 17; SEQ ID NO: 8) 
and human heart cDNA clone 7 sequence (Example 18) were aligned to derive 
20 a novel human cDNA sequence of 1704 base-pairs (SEQ. ID NO: 22). The 
alignment was accomplished with NALIGN, (both PCGENE and Omiga 1.0; 
Intelligenetics). Over-lapping sequences constituted nearly the entire 717 
base-pair region (Example 17) with 100% homology. Joining of the aligned 
sequences was accomplished with SEQIN. 



25 



EXAMPLE 23: Construction of LIM Protein Expression Vector 



The construction of pHIS-5ATG LMP-1 s expression vector was carried 
out with the sequences described in Examples 17 and 18. The 717 base-pair 
30 clone (Example 17; SEQ ID NO: 7) was digested with Clal and Eco RV. A 

small fragment (-250 base-pairs) was gel purified. Clone 7 (Example 18; SEQ 
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ID NO: 8) was digested with CJal and Xbal and a 1400 base-pair fragment was 
gel purified. The isolated 250 base-pair and 1400 base-pair restriction 
fragments were ligated to form a fragment of -1650 base-pairs. 

Due to the single nucleotide substitution in Clone 7 (relative to the 717 
5 base-pair PCR sequence and the original rat sequence) a stop codon at 
translated base-pair 672 resulted. Because of this stop codon, a truncated 
(short) protein was encoded, hence the name LMP-1s. This was the construct 
used in the expression vector (SEQ ID NO: 32). The full length cDNA 
sequence with 5' UTR (SEQ ID NO: 33) was created by alignment of SEQ ID 

10 NO: 32 with the 5' RACE sequence (SEQ ID NO: 21). The amino acid 

sequence of LMP-1s (SEQ ID NO: 34) was then deduced as a 223 amino acid 
protein and confirmed by Western blot (as in Example 15) to run at the 
predicted molecular weight of ~ 23.7 kD. 

The pHis-ATG vector (InVitrogen, Carlsbad, CA) was digested with 

15 Eco RV and Xba l. The vector was recovered and the1650 base-pair restriction 
fragment was then ligated into the linearized pHis-ATG. The ligated product 
was cloned and amplified. The pHis-ATG-LMP-1s Expression vector, also 
designated pHIS-A with insert HLMP-1s, was purified by standard methods. 

20 EXAMPLE 24: Induction of Bone Nodule Formation and Mineralization In vitro 
with LMP Expression Vector 

Rat Calvarial cells were isolated and grown in secondary culture 
according to Example 1 . Cultures were either unstimulated or stimulated with 
25 glucocorticoid (GC) as described in Example 1 . A modification of the Superfect 
Reagent (Qiagen, Valencia, CA) transfection protocol was used to transfect 3 
g/well of each vector into secondary rat calvarial osteoblast cultures according 
to Example 25. 

Mineralized nodules were visualized by Von Kossa staining, as described in 
30 Example 3. Human LMP-1s gene product overexpression alone induced 
bone nodule formation (-203 nodules/well) in vitro. Levels of nodules were 
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approximately 50% of those induced by the GC positive control (-412 
nodules/well). Other positive controls included the pHisA-LMP-Rat expression 
vector (-152 nodules/well) and the pCMV2/LMP-Rat-Fwd Expression vector 
(-206 nodules/well), whereas the negative controls included the pCMV2/LMP- 
5 Rat-Rev. Expression vector (-2 nodules/well) and 

untreated (NT) plates (-4 nodules/well). These data demonstrate that the 
human cDNA was at least as osteoinductive as the rat cDNA. The effect was 
less than that observed with GC stimulation, most likely due to suboptimal 
doses of Expression vector. 

10 

EXAMPLE 25: LMP-lnduced Cell Differentiation In Vitro and In Vivo 

The rat LMP cDNA in clone 10-4 (see Example 12) was excised from 
the vector by double-digesting the clone with Notl and Apa l overnight at 37°C. 

15 Vector pCMV2 MCS (InVitrogen, Carlsbad, CA) was digested with the same 
restriction enzymes. Both the linear cDNA fragment from clone 10-4 and 
pCMV2 were gel purified, extracted and ligated with T4 ligase. The ligated 
DNA was gel purified, extracted and used to transform E. coli JM109 cells for 
amplification. Positive agar colonies were picked, digested with Not l and Apa l 

20 and the restriction digests were examined by gel electrophoresis. Stock 
cultures were prepared of positive clones. 

A reverse vector was prepared in analogous fashion except that the 
restriction enzymes used were Xba l and Hind II I. Because these restriction 
enzymes were used, the LMP cDNA fragment from clone 10-4 was inserted 

25 into pRc/CMV2 in the reverse (that is, non-translatable) orientation. The 
recombinant vector produced is designated pCMV2/RLMP. 

An appropriate volume of pCMV10-4 (60 nM final concentration is 
optimal [3 g]; for this experiment a range of 0-600 nM/well [0-30 g/well] final 
concentration is preferred) was resuspended in Minimal Eagle Media (MEM) to 

30 4 50 I final volume and vortexed for 10 seconds. Superfect was added (7.5 l/ml 
final solution), the solution was vortexed for 10 seconds and then incubated at 
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room termperature for 10 minutes. Following this incubation, MEM 
supplemented with 10% FBS (1 ml/well; 6 ml/plate) was added and mixed by 
pipetting. 

The resulting solution was then promptly pipetted (1 ml/well) onto 
5 washed ROB cultures. The cultures were incubated for 2 hours at 37°C in a 
humidified atmosphere containing 5% C0 2 . Afterward, the cells were gently 
washed once with sterile PBS and the appropriate normal incubation medium 
was added. 

Results demonstrated significant bone nodule formation in all rat cell 
10 cultures which were induced with pCMV10-4. For example, pCMV10-4 

transfected cells produced 429 nodules/well. Positive control cultures, which 
were exposed to Trm, produced 460 nodules/well. In contrast, negative 
controls, which received^no treatment, produced 1 nodule/well. Similarly, when 
cultures were transfected with pCMV10-4 (reverse), no nodules were 
15 observed. 

For demonstrating de novo bone formation in vivo, marrow was 
aspirated from the hindlimbs of 4-5 week old normal rats (rnu/+; heterozygous 
for recessive athymic condition). The aspirated marrow cells were washed in 
alpha MEM, centrifuged, and RBCs were lysed by resuspending the pellet in 
20 0.83% NH 4 CI in 10 mM Tris (pH 7.4). The remaining marrow cells were 

washed 3x with MEM and transfected for 2 hours with 9 g of pCMV-LMP-1s 
(forward or reverse orientation) per 3 x 10 6 cells. The transfected cells were 
then washed 2X with MEM and resuspended at a concentration of 3 x 10 7 
cells/ml. 

25 The cell suspension (100 I) was applied via sterile pipette to a sterile 2 x 

5 mm type I bovine collagen disc (Sulzer Orthopaedics, Wheat Ridge, CO). 
The discs were surgically implanted subcutaneously on the skull, chest, 
abdomen or dorsal spine of 4-5 week old athymic rats (rnu/rnu). The animals 
were scarified at 3-4 weeks, at which time the discs or surgical areas were 

30 excised and fixed in 70% ethanol. The fixed specimens were analyzed by 
radiography and undecalcified histologic examination was performed on 5 m 
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thick sections stained with Goldner Trichrome. Experiments were also 

performed using devitalized (guanidine extracted) demineralized bone matrix 

(Osteotech, Shrewsbury, NJ) in place of collagen discs. 

Radiography revealed a high level of mineralized bone formation that 
5 conformed to the form of the original collagen disc containing LMP-1s 

transfected marrow cells. No mineralized bone formation was observed in the 

negative control (cells transfected with a reverse-oriented version of the LMP- 

1s cDNA that did not code for a translated protein), and absorption of the 

carrier appeared to be well underway. 
io Histology revealed new bone trabeculae lined with osteroblasts in the 

LMP-1s transfected implants. No bone was seen along with partial resorption 

of the carrier in the negative controls. 

Radiography of a further experiment in which i nsets (9 negative control 

pCMV-LMP-REV & 9 experimental pCMV-LMP-1s) of implants were added to 
15 sites alternating between lumbar and thoracic spine in athymic rats 

demonstrated 0/9 negative control implants exhibiting bone formation (spine 

fusion) between vertebrae. All nine of the pCMV-LMP-1s treated implants 

exhibited solid bone fusions between vertebrae. 

20 EXAMPLE 26: The Synthesis of pHIS-5' ATG LMP-1s Expression Vector from 
the sequences Demonstrated in Examples 2 and 3. 

The 717 base-pair clone (Example 17) was digested with Clal and 
EcoRV (New England Biologicals, city, MA). A small fragment (-250 base- 

25 pairs) was gel purified. Clone No. 7 (Example 18) was digested with Clal and 
Xba l. A 1400 base-pair fragment was gel purified from that digest. The 
isolated 250 base-pair and 1400 base-pair cDNA fragments were ligated by 
standard methods to form a fragment of -1650 bp. The pHis-A vector 
(InVitrogen) was digested with EcoRV and Xbal. The linearized vector was 

30 recovered and ligated to the chimeric 1650 base-pair cDNA fragment. The 
ligated product was cloned and amplified by standard methods, and the pHis- 
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A-5' ATG LMP-1s expression vector, also denominated as the vector pHis-A 
with insert HLMP-1s, was deposited at the ATCC as previously described. 

EXAMPLE 27: The Induction of Bone Nodule Formation and Mineralization In 
5 Vitro With pHis-5' ATG LMP-1s Expression Vector 

Rat calvarial cells were isolated and grown in secondary culture 
according to Example 1 . Cultures were either unstimulated or stimulated with 
glucocorticoid (GC) according to Example 1. The cultures were transfected 

10 with 3 g of recombinant 

pHis-A vector DNA/well as described in Example 25. Mineralized nodules 
were visualized by Von Kossa staining according to Example 3. 

Human LMP-1 s gene product overexpression alone (i.e., without GC 
stimulation) induced significant bone nodule formation (-203 nodules/well) in 

15 vitro. This is approximately 50% of the amount of nodules produced by cells 
exposed to the GC positive control (-412 nodules/well). Similar results were 
obtained with cultures transfected with pHisA-LMP-Rat Expression vector 
(-152 nodules/well) and pCMV2/LMP-Rat-Fwd (-206 nodules/well). In 
contrast, the negative control pCMV2/LMP-Rat-Rev yielded (-2 nodules/well), 

20 while approximately 4 nodules/well were seen in the untreaied plates. These 
data demonstrate that the human LMP-1 cDNA was at least as osteoinductive 
as the rat LMP-1 cDNA in this model system. The effect in this experiment was 
less than that observed with GC stimulation; but in some the effect was 
comparable. 

25 

EXAMPLE 28: LMP Induces Secretion of a Soluble Osteoinductive Factor 

Overexpression of RLMP-1 or HLMP-1s in rat calvarial osteoblast 
cultures as described in Example 24 resulted in significantly greater nodule 
30 formation than was observed in the negative control. To study the mechanism 
of action of LIM mineralization protein conditioned medium was harvested at 
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different time points, concentrated to 10 X, sterile filtered, diluted to its original 
concentration in medium containing fresh serum, and applied for four days to 
untransfected cells. 

Conditioned media harvested from cells transfected with RLMP-1 or 
5 HLMP-1s at day 4 was approximately as effective in inducing nodule formation 
as direct overexpression of RLMP-1 in transfected cells. Conditioned media 
from cells transfected with RLMP-1 or HLMP-1 in the reverse orientation had 
no apparent effect on nodule formation. Nor did conditioned media harvested 
from LMP-1 transfected cultures before day 4 induce nodule formation. These 

10 data suggest that expression of LMP-1 caused the synthesis and/or secretion 
of a soluble factor, which did not appear in culture medium in effectie amounts 
until 4 days post transfection. 

Since~overexpression of rLMP-1 Tesulted in the Wcretion of an ~" 
osteoinductive factor into the medium, Western blot analysis was used to 

15 determine if LMP-1 protein was present in the medium. The presence of rLMP- 
1 protein was assessed using antibody specific for LMP-1 (QDPDEE) and 
detected by conventional means. LMP-1 protein was found only in the cell 
layer of the culture and not detected in the medium. 

Partial purification of the osteoinductive soluble factor was accomplished 

20 by standard 25% and 100% ammonium sulfate cuts followed by DE-52 anion 
exchange batch chromatography (100 mM or 500 mM NaCI). All activity was 
observed in the high ammonium sulfate, high NaCI fractions. Such localization 
is consistent with the possibility of a single factor being responsible for 
conditioning the medium. 

25 

EXAMPLE 29: Gene Therapy In Lumbar Spine Fusion Mediated by Low Dose 
Adenovirus 



This study determined the optimal dose of adenoviral delivery of the 
30 LMP-1 cDNA (SEQ ID NO: 2) to promote spine fusion in normal, that is, 
immune competent, rabbits. 
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A replication-deficient human recombinant adenovirus was constructed 
with the LMP-1 cDNA (SEQ ID NO: 2) driven by a CMV promoter using the 
Adeno-Quest™ Kit (Quantum Biotechnologies, Inc., Montreal). A commercially 
available (Quantum Biotechnologies, Inc., Montreal) recombinant adenovirus 
5 containing the beta-galactosidase gene was used as a control. 

Initially, an in vitro dose response experiment was performed to 
determine the optimal concentration of adenovirus-delivered LMP-1 ("AdV- 
LMP-1") to induce bone differentiation in rat calvarial osteoblast cultures using 
a 60-minute transduction with a multiplicity of infection ("MOI") of 0.025, 0.25, 

10 2.5, or 25 plaque-forming units (pfu) of virus per cell. Positive control cultures 
were differentiated by a 7-day exposure to 10 9 M glucocorticoid ("GC"). 
Negative control cultures were left untreated. On day 14, the number of 
mineralized bone nodules was counted after von Kossa staining of the 
cultures, and the level of osteocalcin secreted into the medium (pmol/mL) was 

15 measured by radioimmunoassay (mean ± SEM). 

The results of this experiment are shown in Table I. Essentially no 
spontaneous nodules formed in the untreated negative control cultures. The 
data show that a MOI equal to 0.25 pfu/cell is most effective for osteoinducing 
bone nodules, achieving a level comparable to the positive control (GC). 

20 Lower and higher doses of adenovirus were less effective. 



TABLE I 



Outcome 






AdV-LMP-1 Dose (MOI) 




Neg. 
Ctrl. 


GC 


0.025 


0.25 


2.5 


25 


Bone 
Nodules 


0.5±0.2 


188±35 


79.8±13 


145.1±1 
3 


26.4±15 


87.6±2 


Osteocalcin 


1.0±0.1 


57.8±9 


28.6±1 1 


22.8±1 


18.3±3 


26.0±2 
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In vivo experiments were then performed to determine rf the optimal in 
vitro dose was capable of promoting intertransverse process spine fusions in 
skeletally mature New Zealand white rabbits. Nine rabbits were anesthetized 
and 3 cc of bone marrow was aspirated from the distal femur through the 
5 intercondylar notch using an 18 gauge needle. The buffy coat was then 
isolated, a 10-minute transduction with AdV-LMP-1 was performed, and the 
cells were returned to the operating room for implantation. Single level 
posterolateral lumbar spine arthrodesis was performed with decortication of 
transverse processes and insertion of carrier (either rabbit devitalized bone 
10 matrix or a collagen sponge) containing 8-15 million autologous nucleated 
buffy coat cells transduced with either AdV-LMP-1 (MOI = 0.4) or AdV-BGal 
(MOI = 0.4). Rabbits were euthanized after 5 weeks and spine fusions were 
assessed by manual palpation, plain x-rays, CT scans, and undecalcified 
histology. 

15 The spine fusion sites that received AdV-LMP-1 induced solid, 

continuous spine fusion masses in all nine rabbits. In contrast, the sites 
receiving AdV-BGal, or a lower dose of AdV-LMP-1 (MOI = 0.04) made little or 
no bone and resulted in spine fusion at a rate comparable to the carrier alone 
(< 40%). These results were consistent as evaluated by manual palpation, CT 

20 scan, and histology. Plain radiographs, however, sometimes overestimated 
the amount of bone that was present, especially in the control sites. LMP-1 
cDNA delivery and bone induction was successful with both of the carrier 
materials tested. There was no evidence of systemic or local immune 
response to the adenovirus vector. 

25 These data demonstrate consistent bone induction in a previously 

validated rabbit spine fusion model which is quite challenging. Furthermore, 
the protocol of using autogenous bone marrow cells with intraoperative ex vivo 
gene transduction (10 minutes) is a more clinically feasible procedure than 
other methods that call for overnight transduction or cell expansion for weeks in 

30 culture. In addition, the most effective dose of recombinant adenovirus 

(MOI=0.25) was substantially lower than doses reported in other gene therapy 
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applications (MOI-40-500). We believe this is due to the fact that LMP-1 is an 
intracellular signaling molecule and may have powerful signal amplification 
cascades. Moreover, the observation that the same concentration of AdV- 
LMP-1 that induced bone in cell culture was effective in vivo was also 
5 surprising given the usual required increase in dose of other growth factors 
when translating from cell culture to animal experiments. Taken together, 
these observations indicate that local gene therapy using adenovirus to deliver 
the LMP-1 cDNA is possible and the low dose required will likely minimize the 
negative effects of immune response to the adenovirus vector. 

10 

EXAMPLE 30: Use of Peripheral Venous Blood Nucleated Cells (Buffy Coat) 
for Gene Therapy With LMP-1 cDNA To Make Bone 



In four rabbits we performed spine fusion surgery as above (Example 
15 29) except the transduced cells were the buffy coat from venous blood rather 
than bone marrow. These cells were transfected with Adeno-LMP or pHIS- 
LMP plasmid and had equivalent successful results as when bone marrow cells 
were used. This discovery of using ordinary venous blood cells for gene 
delivery makes gene therapy more feasible clinically since it avoids painful 
20 marrow harvest under general anesthesia and yields two times more cells per 
mL of starting material. 

EXAMPLE 31: Isolation of Human LMP-1 Splice Variants 

25 Intron/Exon mRNA transcript splice variants are a relatively common 

regulatory mechanism in signal transduction and cellular/tissue development. 
Splice variants of various genes have been shown to alter protein-protein, 
protein-DNA, protein-RNA, and protein-substrate interactions. Splice variants 
may also control tissue specificity for gene expression allowing different forms 

30 (and therefore functions) to be expressed in various tissues. Splice variants 
are a common regulatory phenomenon in cells. It is possible that the LMP 
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splice variants may result in effects in other tissues such as nerve 
regeneration, muscle regeneration, or development of other tissues. 

To screen a human heart cDNA library for splice variants of the HLMP-1 
sequence, a pair of PCR primer corresponding to sections of SEQ ID NO: 22 
5 was prepared. The forward PCR primer, which was synthesized using 

standard techniques, corresponds to nucleotides 35-54 of SEQ ID NO: 22. It 
has the following sequence: 

5' GAGCCGGCATCATGGATTCC 3' (SEQ ID NO: 35) 

10 

The reverse PCR primer, which is the reverse complement of 
nucleotides 820-839 in SEQ ID NO: 22, has the following sequence: 

5' GCTGCCTGCACAATGGAGGT 3' (SEQ ID NO: 36) 

15 

The forward and reverse PCR primers were used to screen human heart 
cDNA (ClonTech, Cat No. 7404-1) for sequences similar to HLMP-1 by 
standard techniques, using a cycling protocol of 94°C for 30 seconds, 64°C for 
30 seconds, and 72°C for 1 minute, repeated 30 times and followed by a 10 

20 minute incubation at 72°C. The amplification cDNA sequences were gel- 
purified and submitted to the Emory DNA Sequence Core Facility for 
sequencing. The clones were sequenced using standard techniques and the 
sequences were examined with PCGENE (Intelligenetics; Programs SEQUIN 
and NALIGN) to determine homology to SEQ ID NO: 22. Two homologous 

25 nucleotide sequences with putative alternative splice sites compared to SEQ ID 
NO: 22 were then translated to their respective protein products with 
Intelligenetic's program TRANSL. 

One of these two novel human cDNA sequences (SEQ ID NO: 37) 
comprises 1456 bp: 

30 

10 20 30 40 50 60 
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CGACGCAGAG CAGCGCCCTG GCCGGGCCAA GCAGGAGCCG GCATCATGGA TTCCTTCAAG 
70 . 80 90 100 110 120 

GTAGTGCTGG AGGGGCCAGC ACCTTGGGGC TTCCGGCTGC AAGGGGGCAA GGACTTCAAT 
130 140 150 160 170 180 

5 GTGCCCCTCT CCATTTCCCG GCTCACTCCT GGGGGCAAAG CGGCGCAGGC CGGAGTGGCC 

190 200 210 220 230 240 

GTGGGTGACT GGGTGCTGAG CATCGATGGC GAGAATGCGG GTAGCCTCAC ACACATCGAA 
250 260 270 280 290 300 

GCTCAGAACA AGATCCGGGC CTGCGGGGAG CGCCTCAGCC TGGGCCTCAG CAGGGCCCAG 
10 310 320 * 330 340 * 350 360 

CCGGTTCAGA GCAAACCGCA GAAG GTGCAG ACCCCTGACA A ACAGCCGCT CCGACCGCTG 
370 380 390 " 400 410 "* 420 

GTCCCAGATG CCAGCAAGCA GCGGCTGATG GAGAACACAG AGGACTGGCG GCCGCGGCCG 
430 440 450 460 470 480 

15 GGGACAGGCC AGTCGCGTTC CTTCCGCATC CTTGCCCACC TCACAGGCAC CGAGTTCATG 

490 500 510 520 530 540 

CAAGACCCGG ATGAGGAGCA CCTGAAGAAA TCAAGCCAGG TGCCCAGGAC AGAAGCCCCA 
550 560 570 580 590 600 

GCCCCAGCCT CATCTACACC CCAGGAGCCC TGGCCTGGCC CTACCGCCCC CAGCCCTACC 
20 610 620 630 640 650 660 

AGCCGCCCGC CCTGGGCTGT GGACCCTGCG TTTGCCGAGC GCTATGCCCC GGACAAAACG 
670 680 690 700 710 720 

AGCACAGTGC TGACCCGGCA CAGCCAGCCG GCCACGCCCA CGCCGCTGCA GAGCCGCACC 
730 740 750 760 770 780 

25 TCCATTGTGC AGGCAGCTGC CGGAGGGGTG CCAGGAGGGG GCAGCAACAA CGGCAAGACT 

790 800 810 820 830 840 

CCCGTGTGTC ACCAGTGCCA CAAGGTCATC CGGGGCCGCT ACCTGGTGGC GTTGGGCCAC 
850 860 870 880 890 900 
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GCGTACCACC CGGAGGAGTT TGTGTGTAGC CAGTGTGGGA AGGTCCTGGA AGAGGGTGGC 
910 920 930 940 950 960 

TTCTTTGAGG AGAAGGGCGC CATCTTCTGC CCACCATGCT ATGACGTGCG CTATGCACCC 
970 980 990 1000 1010 1020 

5 AGCTGTGCCA AGTGCAAGAA GAAGATTACA GGCGAGATCA TGCACGCCCT GAAGATGACC 

1030 1040 1050 1060 1070 1080 

TGGCACGTGC ACTGCTTTAC CTGTGCTGCC TGCAAGACGC CCATCCGGAA CAGGGCCTTC 
1090 1100 1110 1120 1130 1140 

TACATGGAGG AGGGCGTGCC CTATTGCGAG CGAGACTATG AGAAGATGTT TGGCACGAAA 
10 1150 1160 1170 1180 1190 1200 

TGCCATGGCT GTGACTTCAA GATCGACGCT GGGGACCGCT TCCTGGAGGC CCTGGGCTTC 



1210 1220 1230 1240 1250 1260 

AGCTGGCATG ACACCTGCTT CGTCTGTGCG ATATGTCAGA TCAACCTGGA AGGAAAGACC 

1270 1280 1290 1300 1310 1320 

15 TTCTACTCCA AGAAGGACAG GCCTCTCTGC AAGAG CCATG CCTTCTCTCA TGTGTGAGCC 

1330 1340 1350 1360 1370 1380 

CCTTCTGCCC ACAGCTGCCG CGGTGGCCCC TAGCCTGAGG GGCCTGGAGT CGTGGCCCTG 

1390 1400 1410 1420 1430 1440 

CATTTCTGGG TAGGGCTGGC AATGGTTGCC TTAACCCTGG CTCCTGGCCC GAGCCTGGGC 
20 1450 

TCCCGGGCCC TGCCCA 



Reading frame shifts caused by the deletion of a 1 19 bp fragment 
(between *) and the addition of a 17 bp fragment (underlined ) results in a 
25 truncated gene product having the following derived amino acid sequence 
(SEQ ID NO: 38): 

10 20 30 40 50 60 
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MDSFKWLEG PAPWGFRLQG GKDFNVPLSI . SRLTPGGKAA QAGVAVGDWV LSIDGENAGS 
70 80 90 100 110 120 

LTHIEAQNKI RACGERLSLG LSRAQPVQNK PQKVQTPDKQ PLRPLVPDAS KQRLMENTED 
130 140 150 160 170 180 

5 WRPRPGTGQS RSFRILAHLT GTEFMQDPDE EHLKKSSQVP RTEAPAPASS TPQEPWPGPT 
190 200 210 220 230 240 

APSPTSRPPW AVDPAFAERY APDKTSTVLT RHSQPATPTP LQSRTSIVQA AAGGVPGGGS 
250 260 270 280 290 300 

NNGKTPVCHQ CHQVIRARYL VALGHAYHPE EFVCSQCGKV LEEGGFFEEK GAIFCPPCYD 
10 310 320 330 340 350 360 

VRYAPSCAKC KKKITGEIMH ALKMTWHVLC FTCAACKTPI RNRAFYMEEG VPYCERDYEK 
370 380 390 400 410 420 

MFGTKCQWCD FKIDAGDRFL EALGFSWHDT CFVCAICQIN LEGKTFYSKK DRPLCKSHAF 
SHV 

15 This 423 amino acid protein demonstrates 100% homology to the 

protein shown in Sequence ID No. 10, except for the sequence in the 
highlighted area (amino acids 94-99), which are due to the nucleotide changes 
depicted above. 

The second novel human heart cDNA sequence (SEQ ID NO: 39) 
20 comprises 1 575 bp: 



10 20 30 40 50 60 

CGACGCAGAG CAGCGCCCTG GCCGGGCCAA GCAGGAGCCG GCATCATGGA TTCCTTCAAG 
70 80 90 100 110 120 

25 GTAGTGCTGG AGGGGCCAGC ACCTTGGGGC TTCCGGCTGC AAGGGGGCAA GGACTTCAAT 

130 140 150 160 170 180 

GTGCCCCTCT CCATTTCCCG GCTCACTCCT GGGGGCAAAG CGGCGCAGGC CGGAGTGGCC 
190 200 210 220 230 240 
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GTGGGTGACT GGGTGCTGAG CATCGATGGC GAGAATGCGG GTAGCCTCAC ACACATCGAA 
250 260 270 280 290 300 

GCTCAGAACA AGATCCGGGC CTGCGGGGAG CGCCTCAGCC TGGGCCTCAG CAGGGCCCAG 
310 320 330 340 350 360 

5 CCGGTTCAGA GCAAACCGCA GAAGGCCTCC GCCCCCGCCG CGGACCCTCC GCGGTACACC 

370 380 390 400 410 420 

TTTGCACCCA GCGTCTCCCT CAACAAGACG GCCCGGCCCT TTGGGGCGCC CCCGCCCGCT 
430 440 450 460 470 480 

GACAGCGCCC CGCAACAGAA TGG GTGCAGA CCCCTGACAA ACAGCCGCTC CGACCGCTGG 
10 4 90 500 510 520 53 0 54 0 

TCCCAGATGC CAGCAAGCAG CGGCTGATGG AGAACACAGA GGACTGGCGG CCGCGGCCGG 
550 ~ 560 570 580 590 ~" 600 

GGACAGGCCA GTCGCGTTCC TTCCGCATCC TTGCCCACCT CACAGGCACC GAGTTCATGC 
610 62 0 630 64 0 650 660 

15 AAGACCCGGA TGAGGAGCAC CTGAAGAAAT CAAGCCAGGT GCCCAGGACA GAAGCCCCAG 

670 680 690 700 710 720 

CCCCAGCCTC ATCTACACCC CAGGAGCCCT GGCCTGGCCC TACCGCCCCC AGCCCTACCA 
730 740 750 760 770 780 

GCCGCCCGCC CTGGGCTGTG GACCCTGCGT TTGCCGAGCG CTATGCCCCG GACAAAACGA 
20 790 800 810 820 830 840 

GCACAGTGCT GACCCGGCAC AGCCAGCCGG CCACGCCCAC GCCGCTGCAG AGCCGCACCT 
850 860 870 880 890 900 

CCATTGTGCA GGCAGCTGCC GGAGGGGTGC CAGGAGGGGG CAGCAACAAC GGCAAGACTC 
910 920 930 940 950 960 

25 CCGTGTGTCA CCAGTGCCAC AAGGTCATCC GGGGCCGCTA CCTGGTGGCG TTGGGCCACG 

970 980 990 1000 1010 1020 

CGTACCACCC GGAGGAGTTT GTGTGTAGCC AGTGTGGGAA GGTCCTGGAA GAGGGTGGCT 
1030 104 0 1050 1060 107 0 1080 
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TCTTTGAGGA GAAGGGCGCC ATCTTCTGCC CACCATGCTA TGACGTGCGC TATGCACCCA 

1090 1100 1110 1120 1130 1140 

GCTGTGCCAA GTGCAAGAAG AAGATTACAG GCGAGATCAT GCACGCCCTG AAGATGACCT 

1150 1160 1170 1180 1190 1200 

5 GGCACGTGCA CTGCTTTACC TGTGCTGCCT GCAAGACGCC CATCCGGAAC AGGGCCTTCT 

1210 1220 1230 1240 1250 1260 

ACATGGAGGA GGGCGTGCCC TATTGCGAGC GAGACTATGA GAAGATGTTT GGCACGAAAT 

1270 1280 1290 1300 1310 1320 

GCCATGGCTG TGACTTCAAG ATCGACGCTG GGGACCGCTT CCTGGAGGCC CTGGGCTTCA 
10 1330 1340 1350 1360 1370 1380 

GCTGGCATGA CACCTGCTTC GTCTGTGCGA TATGTCAGAT CAACCTGGAA GGAAAGACCT 

1390 1400 1410 1420 1430 1440 

TCTACTCCAA GAAGGACAGG CCTCTCTGCA AGAGCCATGC CTTCTCTCAT GTGTGAGCCC 

1450 1460 1470 1480 1490 1500 

15 CTTCTGCCCA CAGCTGCCGC GGTGGCCCCT AGCCTGAGGG GCCTGGAGTC GTGGCCCTGC 

1510 1520 1530 1540 1550 1560 

ATTTCTGGGT AGGGCTGGCA ATGGTTGCCT TAACCCTGGC TCCTGGCCCG AGCCTGGGCT 

1570 

CCCGGGCCCT GCCCA 

20 Reading frame shifts caused by the addition of a 17 bp fragment 

(bolded, italicized and underlined) results in an early translation stop codon at 
position 565-567 (underlined). 

The derived amino acid sequence (SEQ ID NO: 40) consists of 153 
amino acids: 

25 

10 20 30 40 50 60 

MDSFKWLEG PAPWGFRLQG GKDFNVPLSI SRLTPGGKAA QAGVAVGDWV LSIDGENAGS 
70 80 90 100 110 120 
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LTHIEAQNKI RACGERLSLG LSRAQPVQSK POK ASAPAAD PPRYTFAPSV SLNKTARPFG 

130 140 150 

APPPADSAPO ONGCRPLTNS RSDRWSOMPA SSG 

This protein demonstrates 100% homology to SEQ ID NO: 10 until 
5 amino acid 94, where the addition of the 17 bp fragment depicted in the 
nucleotide sequence results in a frame shift. Over amino acids 94-153, the 
protein is not homologous to SEQ ID NO: 10. Amino acids 154-457 in SEQ ID 
NO: 10 are not present due to the early stop codon depicted in nucleotide 
sequence. 
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EXAMPLE 32: Genomic HLMP-1 Nucleotide Sequence 

Applicants have identified the genomic DNA sequence encoding HLMP- 
1, including putative regulatory elements associated with HLMP-1 expression. 
5 The entire genomic sequence is shown in SEQ ID. NO: 41. This sequence 
was derived from AC023788 (clone RP1 1-564G9), Genome Sequencing 
Center, Washington University School of Medicine, St. Louis, MO. 

The putative promoter region for HLMP-1 spans nucleotides 2,660- 
8,733 in SEQ ID NO: 41. This region comprises, among other things, at least 

10 ten potential glucocorticoid response elements ("GREs") (nucleotides 6148- 
6153, 6226-6231, 6247-6252, 6336-6341, 6510-6515, 6552-6557, 6727-6732, 
6752-6757, 7738-7743, and 8255-8260), twelve potential Sma-2 homologues 
to MotheFs against Drosophilla ^ecapentajDlegic ("SMAD") binding element 
sites (nucleotides 3569-3575, 4552-4558, 4582-4588, 5226-5232, 6228-6234, 

15 6649-6655, 6725-6731, 6930-6936, 7379-7384, 7738-7742, 8073-8079, and 
8378-8384), and three TATA boxes (nucleotides 5910-5913, 6932-6935, and 
7380-7383). The three TATA boxes, all of the GREs, and eight of the SMAD 
binding elements ("SBEs") are grouped in the region spanning nucleotides 
5,841-8,733 in SEQ ID NO: 41. These regulatory elements can be used, for 

20 example, to regulate expression of exogenous nucleotide sequences encoding 
proteins involved in the process of bone formation. This would permit systemic 
administration of therapeutic factors or genes relating to bone formation and 
repair, as well as factors or genes associated with tissue differentiation and 
development. 

25 In addition to the putative regulatory elements, 13 exons corresponding 

to the nucleotide sequence encoding HLMP-1 have been identified. These 
exons span the following nucleotides in SEQ ID NO: 41: 



Exon 1 



8733-8767 



30 



Exon 2 



9790-9895 



Exon 3 



13635-13787 
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Exon 4 


13877-13907 


Exon 5 


14387-14502 


Exon 6 


15161-15297 


Exon 7 


15401-15437 


Exon 8 


16483-16545 


Exon 9 


16689-16923 


Exon 10 


18068-18248 


Exon 1 1 


22117-22240 


Exon 12 


22323-22440 


Exon 13 


22575-22911 



In HLMP-2 there is another exon (Exon 5A) : which spans nucleotides 
14887-14904. 

All cited publications and patents are hereby incorporated by reference 
in their entirety. 

15 While the foregoing specification teaches the principles of the present 

invention, with examples provided for the purpose of illustration, it will be 
appreciated by one skilled in the art from reading this disclosure that various 
changes in form and detail can be made without departing from the true scope 
of the invention. 
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reference number 6148.* ^-304 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 17 , line 20 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet [X 



Name of depositary institution 
American Type Culture Collection 



Address of depositary institution (including postal code and country) 
inon.i i.u;„n^;i,,.Diu^ 

Manassas, VA 20110-2209 



Date of deposit 


Accession Number 


14 April 2000 


PTA-1698 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet | | 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g.. 
"Accession Number of Deposit") 



□ 



For receiving Office use only 



This sheet was received with the international application 



Authorized officer 



For International Bureau use only 



| | This sheet was received by the International Bureau on: 

1 7 JUL 2000 



Authorized offlceT 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRule ttbis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 17 , line 21 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositary institution 
American Type Culture Collection 



Address of depositary institution (including postal code and country) 
10801 University Blvd. 
Manassas, VA 20110-2209 



Date of deposit 


Accession Number 


14 April 2000 


PTA-1699 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g. 
"Accession Number of Deposit n ) 



For receiving Office use only 



| | This sheet was received with the international application 



Authorized officer 



For Internationa] Bureau use only 



| | This sheet was received by the International Bureau on: 

1 7 JUL 2000 



Authorized officer 




Form PCT/RO/134 (July 1992) 
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We claim: 

1. An isolated nucleic acid molecule comprising a SEQ ID NO: 37 or 
SEQ ID NO: 39. 

2. An isolated human LMP protein encoded by SEQ ID NO: 37 or SEQ 
5 ID NO: 39. 

3. A vector comprising the isolated nucleic acid molecule of claims 1. 

4. A host cell comprising the vector of claim 3. 

5. The host cell of claim 4, wherein the host cell is selected from the 
group consisting of prokaryotic cells, yeast cells and mammalian 

10 cells 

6. The isolated nucleic acid molecule of claim 1 , further comprising a 

label. 

7. A human LIM mineralization protein comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 38 and SEQ ID 

15 NO: 40. 

8. A monoclonal antibody specific for a HLMP-2 (SEQ ID NO: 38) or 
HLMP-3 (SEQ ID NO: 40). 

9. A method of inducing bone formation comprising transfecting 
osteogenic precursor cells or peripheral blood leukocytes with an isolated 

20 nucleic acid molecule comprising SEQ ID NO: 39. 

10. The method of claim 9, wherein the isolated nucleic acid molecule is 
in a vector. 

11. The method of claim 10, wherein the vector is an expression vector. 

12. The method of claim 1 1 , wherein the vector is a plasmid. 
25 1 3. The method of claim 1 1 , wherein the vector is a virus. 

14. The method of claim 13, wherein the virus is an adenovirus 

15. The method of claim 13, wherein the virus is a retrovirus. 

16. The method of claim 9, wherein the osteogenic precursor cells or 
peripheral blood leukocytes are transfected ex vivo. 

30 17. The method of claim 9, wherein the osteogenic precursor cells are 

transfected in vivo by direct injection of the isolated nucleic acid molecule. 
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18. The method of claim 9, wherein the LIM mineralization protein is 
HLMP-3 (SEQ ID NO: 40). 

19. A method of fusing a spine, comprising: 

(a) transfecting osteogenic precursor cells or peripheral blood 

5 leukocytes with an isolated nucleic acid molecule comprising SEQ ID NO: 39; 

(b) admixing the transfected osteogenic precursor cells or peripheral 
blood leukocytes with a matrix; and 

(c) contacting the matrix with the spine; 

wherein expression of the nucleotide sequence causes mineralized 
10 bone formation in the matrix. 

20. The method of claim 19, wherein the osteogenic precursor cells or 
peripheral blood cells are transfected ex vivo. 

21. A method^nnduang systemic bone formation in a mammalian host 
in need thereof, comprising: 

15 a) transfecting osteogenic precursor cells or peripheral blood 

leukocytes with a vector that is stablely maintained in the osteogenic precursor 
cells or peripheral blood leukocytes, the vector comprising SEQ ID NO: 39 
operably linked to a regulatable promoter, wherein the regulatable promoter 
responds to an exogenous control compound; and 

20 (b) administering to the host, as needed, an amount of the 

exogenous control substance effective to cause expression of SEQ ID NO: 39. 
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22. A method of stimulating production of an osteogenic soluble factor 
by an osteogenic cell, comprising: 

(a) transfecting the osteogenic cell or peripheral blood leukocyte with 
an isolated nucleic acid molecule comprising SEQ ID NO: 39; and 
5 (b) overexpressing the isolated nucleic acid molecule. 

23. An osteogenic soluble factor produced by the method of claim 22. 

24. The osteogenic soluble factor of claim 23, wherein the osteogenic 
factor is a protein. 

25. A method of inhibiting the expression of HLMP-2 or HLMP-3 

10 comprising transfecting a cell wherein HLMP-2 or HLMP-3 is expressed with an 
antisense oligonucleotide. 

26. The method of claim of claim 17, wherein the isolated nucleic acid 
molecule is in a vector selected from the group consisting of a plasmid and a 
virus. 

15 27. The method of claim 26, wherein the vector is a plasmid, which 

plasmid is directly injected into muscle tissue. 
28. A method of inhibiting bone formation comprising transfecting 
osteogenic precursor cells or peripheral blood leukocytes with an isolated 
nucleic acid molecule comprising SEQ ID NO: 37. 
20 29. The method of claim 28, wherein the isolated nucleic acid molecule 

is in a vector. 

30. The method of claim 29, wherein the vector is an expression vector. 

31 . The method of claim 30, wherein the vector is a plasmid. 

32. The method of claim 30, wherein the vector is a virus. 

25 33. The method of claim 32, wherein the virus is an adenovirus 

34. The method of claim 32, wherein the virus is a retrovirus. 

35. The method of claim 28, wherein the osteogenic precursor cells or 
peripheral blood leukocytes are transfected ex vivo. 

36. The method of claim 28, wherein the osteogenic precursor cells are 
30 transfected in vivo by direct injection of the isolated nucleic acid molecule. 
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37. The method of claim 28, wherein the LIM mineralization protein is 
HLMP-2 (SEQ ID NO: 38). 
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SEQUENCE LISTING 



<110> Boden M.D., Scott D 

Hair Ph.D. , Gregory A 

<120> LIM Mineralization Protein Splice Variants 

<130> splice variant LMP 

<140> 6148.0222-00304 
<141> 2000-04-28 

<150> 60/132,021 
<151> 1999-04-30 

<160> 42 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 457 

<212> PRT " " ------ - -— - 

<213> Rattus norvegicus 

<400> 1 

Met Asp Ser Phe Lys Val Val Leu Glu Gly Pro Ala Pro Trp Gly Phe 
15 10 15 

Arg Leu Gin Gly Gly Lys Asp Phe Asn Val Pro Leu Ser lie Ser Arg 
20 25 30 

Leu Thr Pro Gly Gly Lys Ala Ala Gin Ala Gly Val Ala Val Gly Asp 
35 40 45 

Trp Val Leu Ser lie Asp Gly Glu Asn Ala Gly Ser Leu Thr His lie 
50 55 60 

Glu Ala Gin Asn Lys lie Arg Ala Cys Gly Glu Arg Leu Ser Leu Gly 
65 70 75 80 

Leu Ser Arg Ala Gin Pro Ala Gin Ser Lys Pro Gin Lys Ala Leu Thr 
85 90 95 

Pro Pro Ala Asp Pro Pro Arg Tyr Thr Phe Ala Pro Ser Ala Ser Leu 
100 105 110 

Asn Lys Thr Ala Arg Pro Phe Gly Ala Pro Pro Pro Thr Asp Ser Ala 
115 120 125 

Leu Ser Gin Asn Gly Gin Leu Leu Arg Gin Leu Val Pro Asp Ala Ser 
130 135 140 

Lys Gin Arg Leu Met Glu Asn Thr Glu Asp Trp Arg Pro Arg Pro Gly 
145 150 155 160 
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Thr Gly Gin Ser Arg Ser Phe Arg lie Leu Ala His Leu Thr Gly Thr 
165 170 175 

Glu Phe Met Gin Asp Pro Asp Glu Glu Phe Met Lys Lys Ser Ser Gin 
180 185 190 

Val Pro Arg Thr Glu Ala Pro Ala Pro Ala Ser Thr He Pro Gin Glu 
195 200 205 

Ser Trp Pro Gly Pro Thr Thr Pro Ser Pro Thr Ser Arg Pro Pro Trp 
210 215 220 

Ala Val Asp Pro Ala Phe Ala Glu Arg Tyr Ala Pro Asp Lys Thr Ser 
225 230 235 240 

Thr Val Leu Thr Arg His Ser Gin Pro Ala Thr Pro Thr Pro Leu Gin 
245 250 255 

Asn Arg Thr Ser He Val Gin Ala Ala Ala Gly Gly Gly Thr Gly Gly 
260 265 270 

Gly Ser Asn Asn Gly Lys Thr Pro Val Cys His Gin Cys His Lys He 
275 2 80- 285 

He Arg Gly Arg Tyr Leu Val Ala Leu Gly His Ala Tyr His Pro Glu 
290 295 300 

Glu Phe Val Cys Ser Gin Cys Gly Lys Val Leu Glu Glu Gly Gly Phe 
305 310 315 320 

Phe Glu Glu Lys Gly Ala He Phe Cys Pro Ser Cys Tyr Asp Val Arg 
325 330 335 

Tyr Ala Pro Ser Cys Ala Lys Cys Lys Lys Lys He Thr Gly Glu He 
340 345 350 

Met His Ala Leu Lys Met Thr Trp His Val Pro Cys Phe Thr Cys Ala 
355 360 365 

Ala Cys Lys Thr Pro He Arg Asn Arg Ala Phe Tyr Met Glu Glu Gly 
370 375 380 

Ala Pro Tyr Cys Glu Arg Asp Tyr Glu Lys Met Phe Gly Thr Lys Cys 
385 390 395 400 

Arg Gly Cys Asp Phe Lys He Asp Ala Gly Asp Arg Phe Leu Glu Ala 
405 410 415 

Leu Gly Phe Ser Trp His Asp Thr Cys Phe Val Cys Ala He Cys Gin 
420 425 430 

He Asn Leu Glu Gly Lys Thr Phe Tyr Ser Lys Lys Asp Lys Pro Leu 
435 440 445 

Cys Lys Ser His Ala Phe Ser His Val 
450 455 
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<210> 2 
<211> 1696 
<212> DNA 

<213> Rattus norvegicus 
<400> 2 

gcacgaggat cccagcgcgg ctcctggagg ccgccaggca gccgcccagc cgggcattca 60 

ggagcaggta ccatggattc cttcaaggta gtgctggagg gacctgcccc ttggggcttc 12 0 

cgtctgcaag ggggcaagga cttcaacgtg cccctctcca tctctcggct cactcctgga 180 

ggcaaggccg cacaggccgg tgtggccgtg ggagactggg tactgagtat cgacggtgag 24 0 

aacgccggaa gcctcacaca cattgaagcc cagaacaaga tccgtgcctg tggggagcgc 300 

ctcagcctgg gtcttagcag agcccagcct gctcagagca aaccacagaa ggccctgacc 36 0 

cctcccgccg accccccgag gtacactttt gcaccaagcg cctccctcaa caagacggcc 420 

cggcccttcg gggcaccccc acctactgac agcgccctgt cgcagaatgg acagctgctc 4 80 

agacagctgg tccctgatgc cagcaagcag cggctgatgg agaatactga agactggcgc 54 0 

ccgcggccag ggacaggcca gtcccgttcc ttccgcatcc ttgctcacct cacgggcaca 600 

gagttcatgc aagacccgga tgaggaattc atgaagaagt caagccaggt gcccaggaca 660 

gaagccccag ccccagcctc aaccataccc caggaatcct ggcctggccc caccaccccc 720 

r*t/-*/~>rt/~>r*n~>-nr^/~t /~rr-rs*r r-> /~>/-ri- t~r o ei r* *- «n Tt ¥• *r *~ ir> f- <"r t> « *-r *~ ~ ~ — -I n r\ 
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gacaaaacca gcacagtgct gacccgacac agccagccag ccacacctac gcctctgcag 84 0 
aaccgcacct ccatagttca g gctgcagct ggag ggggca caggaggagg cagcaacaat 900 
ggcaagacgc ctgtatgcca ccagtgccac aagatcatcc gcggccgata cctggtagca 960 
ctgggccacg cgtaccatcc tgaggaattt gtgtgcagcc agtgtgggaa ggtcctggaa 1020 
gagggtggct tcttcgagga gaagggagct atcttttgcc cctcctgcta tgatgtgcgc 1080 
tatgcaccca gctgtgccaa atgcaagaag aagatcactg gagagatcat gcatgcgctg 1140 
aagatgacct ggcatgttcc ctgcttcacc tgtgcagcct gcaaaacccc tatccgcaac 1200 
agggctttct acatggagga gggggctccc tactgcgagc gagattacga gaagatgttt 1260 
ggcacaaagt gtcgcggctg tgacttcaag atcgatgccg gggaccgttt cctggaagcc 132 0 
ctgggtttca gctggcatga tacgtgtttt gtttgcgcaa tatgtcaaat caacttggaa 1380 
ggaaagacct tctactccaa gaaggacaag cccctgtgca agagccatgc cttttcccac 144 0 
gtatgagcac ctcctcacac tactgccacc ctactctgcc agaagggtga taaaatgaga 1500 
gagctctctc tccctcgacc tttctgggtg gggctggcag ccattgtcct agccttggct 1560 
cctggccaga tcctggggct ccctcctcac agtccccttt cccacacttc ctccaccacc 1620 
accaccgtca ctcacaggtg ctagcctcct agccccagtt cactctggtg tcacaataaa 1680 
cctgtatgta gctgtg 1696 



<210> 3 
<211> 260 
<212> DNA 

<213> Rattus norvegicus 
<400> 3 

ttctacatgg aggagggggc tccctactgc gagcgagatt acgagaagat gtttggcaca 
aagtgtcgcg gctgtgactt caagatcgat gccggggacc gtttcctgga agccctgggt 
ttcagctggc atgatacgtg ttttgtttgc gcaatatgtc aaatcaactt ggaaggaaag 
accttctact ccaagaagga caagcccctg tgcaagagcc atgccttttc ccacgtatga 
gcacctcctc acactactgc 



<210> 4 
<211> 16 
<212> DNA 
<213> MMLV 



3 
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<400> 4 

aagctttttt tttttg 16 



<210> 5 
<211> 13 
<212> DNA 
<213> MMLV 

<400> 5 

aagcttggct atg 13 



<210> 6 

<211> 223 

<212> DNA 

<213> Homo sapiens 

<400> 6 

atcctfcgctc scctcacggg caccgsgt tc 
aaatcaagcc aggtgcccag gacagaagcc 
ccctggcctg gccctaccgc ccccagccct 
gcgtttgccg agcgctatgc cccagacaaa 



atgoaagacc cgga£~gagga gcaoc^gaag 60 
ccagccccag cctcatctac accccaggag 12 0 
accagccgcc cgccctgggc tgtggaccct 18 0 
accagcacag tgc 223 



<210> 7 

<211> 717 

<212> DNA 

<213> Homo sapiens 

<400> 7 

atggattcct tcaaggtagt gctggagggg 
ggcaaggact tcaatgtgcc cctctccatt 
caggccggag tggccgtggg tgactgggtg 
ctcacacaca tcgaagctca gaacaagatc 
ctcagcaggg cccagccggt tcagagcaaa 
cctccgcggt acacctttgc acccagcgtc 
gcgcccccgc ccgctgacag cgccccgcaa 
ccagatgcca gcaagcagcg gctgatggag 
acaggccagt cgcgttcctt ccgcatcctt 
gacccggatg aggagcacct gaagaaatca 
ccagcctcat ctacacccca ggagccctgg 
cgcccgccct gggctgtgga ccctgcgttt 



ccagcacctt ggggcttccg gctgcaaggg 60 
tcccggctca ctcctggggg caaagcggcg 120 
ctgagcatcg atggcgagaa tgcgggtagc 180 
cgggcctgcg gggagcgcct cagcctgggc 240 
ccgcagaagg cctccgcccc cgccgcggac 300 
tccctcaaca agacggcccg gccctttggg 360 
cagaatggac agccgctccg accgctggtc 420 
aacacagagg actggcggcc gcggccgggg 480 
gcccacctca caggcaccga gttcatgcaa 540 
agccaggtgc ccaggacaga agccccagcc 600 
cctggcccta ccgcccccag ccctaccagc 660 
gccgagcgct atgccccgga caaaacg 717 



<210> 8 

<211> 1488 

<212> DNA 

<213> Homo sapiens 

<400> 8 

atcgatggcg agaatgcggg tagcctcaca 
tgcggggagc gcctcagcct gggcctcagc 
aaggcctccg cccccgccgc ggaccctccg 
aacaagacgg cccggccctt tggggcgccc 
ggacagccgc tccgaccgct ggtcccagat 



cacatcgaag ctcagaacaa gatccgggcc 60 
agggcccagc cggttcagag caaaccgcag 120 
cggtacacct ttgcacccag cgtctccctc 180 
ccgcccgctg acagcgcccc gcaacagaat 240 
gccagcaagc agcggctgat ggagaacaca 300 
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gaggactggc ggccgcggcc ggggacaggc 
ctcacaggca ccgagttcat gcaagacccg 
gtgcccagga cagaagcccc agccccagcc 
cctaccgccc ccagccctac cagccgcccg 
cgctatgccc cggacaaaac gagcacagtg 
acgccgctgc agagccgcac ctccattgtg 
ggcagcaaca acggcaagac tcccgtgtgt 
tacctggtgg cgttgggcca cgcgtaccac 
aaggtcctgg aagagggtgg cttctttgag 
tatgacgtgc gctatgcacc cagctgtgcc 
atgcacgccc tgaagatgac ctggcacgtg 
cccatccgga acagggcctt ctacatggag 
gagaagatgt ttggcacgaa atgccatggc 
ttcctggagg ccctgggctt cagctggcat 
atcaacctgg aaggaaagac cttctactcc 
gccttctctc atgtgtgagc cccttctgcc 
gggcctggag tcgtggccct gcatttctgg 
gctcctggcc cgagcctggg ctcccgggcc 
ctccctccac caccacagca caccggtgct 
ccacaatiaaa cctgtaccca gctgcicittcc 



cagtcgcgtt ccttccgcat ccttgcccac 360 
gatgaggagc acctgaagaa atcaagccag 420 
tcatctacac cccaggagcc ctggcctggc 480 
ccctgagctg tggaccctgc gtttgccgag 54 0 
ctgacccggc acagccagcc ggccacgccc 600 
caggcagctg ccggaggggt gccaggaggg 660 
caccagtgcc acaaggtcat ccggggccgc 720 
ccggaggagt ttgtgtgtag ccagtgtggg 780 
gagaagggcg ccatcttctg cccaccatgc 84 0 
aagtgcaaga agaagattac aggcgagatc 900 
cactgcttta cctgtgctgc ctgcaagacg 960 
gagggcgtgc cctattgcga gcgagactat 102 0 
tgtgacttca agatcgacgc tggggaccgc 1080 
gacacctgct tcgtctgtgc gatatgtcag 114 0 
aagaaggaca ggcctctctg caagagccat 1200 
cacagctgcc gcggtggccc ctagcctgag 1260 
gtagggctgg caatggttgc cttaaccctg 132 0 
cctgcccacc caccttatcc tcccacccca 1380 
ggccacacca gccccctttc acctccagtg 1440 
SuSSaS tecs aaaaaaaa 1488 



<210> 9 
<211> 1644 
<212> DNA 

<213> Homo sapiens 
<400> 9 

atggattcct tcaaggtagt gctggagggg 
ggcaaggact tcaatgtgcc cctctccatt 
caggceggag tggccgtggg tgactgggtg 
ctcacacaca tcgaagctca gaacaagatc 
ctcagcaggg cccagccggt tcagagcaaa 
cctccgcggt acacctttgc acccagcgtc 
gcgcccccgc ccgctgacag cgccccgcaa 
ccagatgcca geaagcageg gctgatggag 
acaggecagt cgcgttcctt ccgcatcctt 
gaeceggatg aggagcacct gaagaaatca 
ccagcctcat ctacacccca ggagccctgg 
cgcccgccct gggctgtgga ccctgcgttt 
acagtgctga cccggcacag ccagccggcc 
attgtgcagg cagctgccgg aggggtgeca 
gtgtgtcacc agtgccacaa ggtcatcegg 
taccacccgg aggagtttgt gtgtagccag 
tttgaggaga agggegecat cttctgccca 
tgtgccaagt gcaagaagaa gattacaggc 
cacgtgcact gctttacctg tgctgcctgc 
atggaggagg gcgtgcccta ttgegagega 
catggctgtg acttcaagat cgacgctggg 
tggcatgaca cctgcttcgt ctgtgcgata 
tactccaaga aggacaggee tetctgeaag 
tctgcccaca gctgccgcgg tggcccctag 
ttctgggtag ggctggcaat ggttgcctta 
cgggcccctg cccacccacc ttatcctccc 
ggtgctggcc acaccagccc cctttcacct 
aattccaaaa aatccaaaaa aaaa 



ccagcacctt ggggcttccg getgeaaggg 6 0 
tcccggctca ctcctggggg caaageggeg 120 
ctgagcatcg atggcgagaa tgegggtage 18 0 
cgggcctgcg gggagegect cagcctgggc 240 
cegcagaagg cctccgcccc cgccgcggac 300 
tccctcaaca agacggcccg gccctttggg 360 
cagaatggac agccgctccg accgctggtc 420 
aacacagagg actggcggcc gcggccgggg 480 
gcccacctca caggcaccga gttcatgcaa 54 0 
agccaggtgc ccaggacaga agccccagcc 600 
cctggcccta ccgcccccag ccctaccagc 660 
gccgagcgct atgccccgga caaaacgagc 72 0 
acgcccacgc egctgeagag ccgcacctcc 780 
9<jagggggca geaacaaegg caagactccc 840 
ggccgctacc tggtggcgtt gggccacgcg 900 
tgtgggaagg tcctggaag.a gggtggcttc 960 
ecatgetatg aegtgegcta tgcacccagc 1020 
gagatcatgc acgccctgaa gatgacctgg 1080 
aagacgccca teeggaacag ggccttctac 1140 
gactatgaga agatgtttgg cacgaaatgc 1200 
gaccgcttcc tggaggcect gggcttcagc 1260 
tgtcagatca acctggaagg aaagaccttc 1320 
agccatgcct tctctcatgt gtgagcccct 1380 
cctgaggggc ctggagtcgt ggccctgcat 1440 
accctggctc ctggcccgag cctgggctcc 1500 
accccactcc ctccaccacc acagcacacc 1560 
ccagtgccac aataaacctg tacccagctg 1620 

1644 
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<210> 10 

<211> 457 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Met Asp Ser Phe Lys Val Val Leu Glu Gly Pro Ala Pro Trp Gly Phe 
15 10 15 

Arg Leu Gin Gly Gly Lys Asp Phe Asn Val Pro Leu Ser lie Ser Arg 
20 25 30 

Leu Thr Pro Gly Gly Lys Ala Ala Gin Ala Gly Val Ala Val Gly Asp 
35 40 45 

Trp Val Leu Ser He Asp Gly Glu Asn Ala Gly Ser Leu Thr His He 
50 55 60 

Glu Ala Gin Asn Lys He Arg Ala Cys Gly Glu Arg Leu Ser Leu Gly 
65 70 75 80 

Leu-Ser Arg Ala Gin Pro Val Gin Ser Lys Pro Gin Lys Ala Ser Ala 
85 90 95 

Pro Ala Ala Asp Pro Pro Arg Tyr Thr Phe Ala Pro Ser Val Ser Leu 
100 105 110 

Asn Lys Thr Ala Arg Pro Phe Gly Ala Pro Pro Pro Ala Asp Ser Ala 
115 120 125 

Pro Gin Gin Asn Gly Gin Pro Leu Arg Pro Leu Val Pro Asp Ala Ser 
130 135 140 

Lys Gin Arg Leu Met Glu Asn Thr Glu Asp Trp Arg Pro Arg Pro Gly 
145 150 155 160 

Thr Gly Gin Ser Arg Ser Phe Arg He Leu Ala His Leu Thr Gly Thr 
165 170 175 

Glu Phe Met Gin Asp Pro Asp Glu Glu His Leu Lys Lys Ser Ser Gin 
180 185 190 

Val Pro Arg Thr Glu Ala Pro Ala Pro Ala Ser Ser Thr Pro Gin Glu 
195 200 205 

Pro Trp Pro Gly Pro Thr Ala Pro Ser Pro Thr Ser Arg Pro Pro Trp 
210 215 220 

Ala Val Asp Pro Ala Phe Ala Glu Arg Tyr Ala Pro Asp Lys Thr Ser 
225 230 235 240 

Thr Val Leu Thr Arg His Ser Gin Pro Ala Thr Pro Thr Pro Leu Gin 
245 250 255 

Ser Arg Thr Ser He Val Gin Ala Ala Ala Gly Gly Val Pro Gly Gly 

6 
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260 265 270 

Gly Ser Asn Asn Gly Lys Thr Pro Val Cys His Gin Cys His Lys Val 
275 280 285 

lie Arg Gly Arg Tyr Leu Val Ala Leu Gly His Ala Tyr His Pro Glu 
290 295 300 

Glu Phe Val Cys Ser Gin Cys Gly Lys Val Leu Glu Glu Gly Gly Phe 
305 310 315 320 

Phe Glu Glu Lys Gly Ala lie Phe Cys Pro Pro Cys Tyr Asp Val Arg 
325 330 335 

Tyr Ala Pro Ser Cys Ala Lys Cys Lys Lys Lys lie Thr Gly Glu lie 
340 345 350 

Met His Ala Leu Lys Met Thr Trp His Val His Cys Phe Thr Cys Ala 
355 360 365 

Ala Cys Lys Thr Pro lie Arg Asn Arg Ala Phe Tyr Met Glu Glu Gly 
370 375 380 

Val Pro Tyr Cys Glu Arg Asp Tyr Glu Lys Met Phe Gly Thr Lys Cys 
385 390 395 400 

His Gly Cys Asp Phe Lys lie Asp Ala Gly Asp Arg Phe Leu Glu Ala 
405 410 415 

Leu Gly Phe Ser Trp His Asp Thr Cys Phe Val Cys Ala lie Cys Gin 
420 425 430 

lie Asn Leu Glu Gly Lys Thr Phe Tyr Ser Lys Lys Asp Arg Pro Leu 
435 440 445 

Cys Lys Ser His Ala Phe Ser His Val 
450 455 



<210> 11 
<211> 22 
<212> DNA 

<213> Rattus norvegicus 
<400> 11 

gccagggttt tcccagtcac ga 22 



<210> 12 
<211> 22 
<212> DNA 

<213> Rattus norvegicus 
<400> 12 

gccagggttt tcccagtcac ga 22 
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<210> 13 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 13 

tcttagcaga gcccagcctg ct 22 



<210> 14 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 14 

gcatgaactc tgtgcccgtg ag 22 



<210> 15 
<211> 20 
<212> DNA 

<213> Rattus norvegicus 



<400> 15 

atccttgctc acctcacggg 20 



<210> 16 
<211> 22 
<212> DNA 

<213> Rattus norvegicus 
<400> 16 

gcactgtgct ggttttgtct gg 22 



<210> 17 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 17 

catggattcc ttcaaggtag tgc 23 



<210> 18 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 18 

gttttgtctg gggcagagcg 20 



<210> 19 
<211> 44 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: adaptor for 
Marathon RACE reactions 

<400> 19 

ctaatacgac tcactatagg gctcgagcgg ccgcccgggc aggt 44 

<210> 20 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: PCR primer 
specific for Marathon RACE adaptor 

<400> 20 

ccatcctaat acgactcact atagggc 2 7 

<210> 21 
<211> 765 
<212> DNA 

<213> Homo sapiens 
<400> 21 

ccgttgtttg taaaacgacg cagagcagcg ccctggccgg gccaagcagg agccggcatc 
atggattcct tcaaggtagt gctggagggg ccagcacctt ggggcttccg gctgcaaggg 
ggcaaggact tcaatgtgcc ctcctccatt tcccggctca cctctggggg caaggccgtg 
caggccggag tggccgtaag tgactgggtg ctgagcatcg atggcgagaa tgcgggtagc 
ctcacacaca tcgaagctca gaacaagatc cgggcctgcg gggagcgcct cagcctgggc 
ctcaacaggg cccagccggt tcagaacaaa ccgcaaaagg cctccgcccc cgccgcggac 
cctccgcggt acacctttgc accaagcgtc tccctcaaca agacggcccg gcccttgggg 
gcgcccccgc ccgctgacag cgccccgcag cagaatggac agccgctccg accgctggtc 
ccagatgcca gcaagcagcg gctgatggag aacacagagg actggcggcc gcggccgggg 
acaggccagt gccgttcctt tcgcatcctt gctcacctta caggcaccga gttcatgcaa 
gacccggatg aggagcacct gaagaaatca agccaggtgc ccaggacaga agccccagcc 
ccagcctcat ctacacccca ggagccctgg cctggcccta ccgcccccag ccctaccagc 
cgcccgccct gggctgtgga ccctgcgttt gccgagcgct atgcc 



<210> 22 
<211> 1689 
<212> DNA 
<213> Homo sapiens 

<400> 22 

cgacgcagag cagcgccctg gccgggccaa gcaggagccg gcatcatgga ttccttcaag 60 

gtagtgctgg aggggccagc accttggggc ttccggctgc aagggggcaa ggacttcaat 120 

gtgcccctct ccatttcccg gctcactcct gggggcaaag cggcgcaggc cggagtggcc 180 

gtgggtgact gggtgctgag catcgatggc gagaatgcgg gtagcctcac acacatcgaa 240 

gctcagaaca agatccgggc ctgcggggag cgcctcagcc tgggcctcag cagggcccag 300 

9 
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ccggttcaga gcaaaccgca gaaggcctcc 
tttgcaccca gcgtctccct caacaagacg 
gacagcgccc cgcaacagaa tggacagccg 
cagcggctga tggagaacac agaggactgg 
tccttccgca tccttgccca cctcacaggc 
cacctgaaga aatcaagcca ggtgcccagg 
ccccaggagc cctggcctgg ccctaccgcc 
gtggaccctg cgtttgccga gcgctatgcc 
cacagccagc cggccacgcc cacgccgctg 
gccggagggg tgccaggagg gggcagcaac 
cacaaggtca tccggggccg ctacctggtg 
tttgtgtgta gccagtgtgg gaaggtcctg 
gccatcttct gcccaccatg ctatgacgtg 
aagaagatta caggcgagat catgcacgcc 
acctgtgctg cctgcaagac gcccatccgg 
ccctattgcg agcgagacta tgagaagatg 
aagatcgacg ctggggaccg cttcctggag 
ttcgtctgtg cgatatgtca gatcaacctg 
aggcctctct gcaagagcca tgccttctct 
egcggtggec cctagcctga ggggcctgga 
gcaatggttg ccttaaccct ggctcctggc 
ccaccttatc ctcccacccc actccctcca 
agcccccttt cacctccagt gccacaataa 
aaaaaaaaa 



gcccccgccg cggaccctcc gcggtacacc 360 
gcccggccct ttggggcgcc cccgcccgct 420 
ctccgaccgc tggtcccaga tgccagcaag 480 
cggccgcggc cggggacagg ccagtcgcgt 54 0 
accgagttca tgcaagaccc ggatgaggag 600 
acagaagccc cagccccagc ctcatctaca 660 
cccagcccta ccagccgccc gccctgggct 720 
ccggacaaaa cgagcacagt gctgacccgg 780 
cagagccgca cctccattgt gcaggcagct 84 0 
aacggcaaga ctcccgtgtg tcaccagtgc 900 
gcgttgggcc acgcgtacca cccggaggag 960 
gaagagggtg gcttctttga ggagaagggc 1020 
cgctatgcac ccagctgtgc caagtgcaag 1080 
ctgaagatga cctggcacgt gcactgcttt 1140 
aacagggcct tctacatgga ggagggcgtg 1200 
tttggcacga aatgccatgg ctgtgacttc 1260 
gccctgggct tcagctggca tgacacctgc 1320 
gaaggaaaga ccttctactc caagaaggac 1380 
catgtgtgag ccccttctgc ccacagctgc 144 0 

CftCCXtCrCICCC tCJCa*"*"*" r ' t * rT rini-^rrC[ rtr *^ rt 1 QflD 

ccgagcctgg gctcccgggc ccctgcccac 1560 
ccaccacagc acaccggtgc tggccacacc 1620 
acctgtaccc agctgaattc caaaaaa tec 1680 

1689 



<210> 23 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 23 

gcactgtgct cgttttgtcc gg 22 



<210> 24 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 24 

tccttgctca cctcacgggc a 21 



<210> 25 

<211> 30 

<212> DNA 

<213> Homo sapiens 

<400> 25 

tcctcatccg ggtcttgeat gaactcggtg 30 



<210> 26 
<211> 28 
<212> DNA 
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<213> Homo sapiens 



<400> 26 



gcccccgccc gctgacagcg ccccgcaa 



28 



<210> 27 
<211> 24 
<212> DNA 

<213> Homo sapiens 
<400> 27 

tccttgctca cctcacgggc accg 24 



<210> 28 

<211> 22 

<212> DNA 

<213> Homo sapiens 



<210> 29 
<211> 23 
<212> DNA 

<213> Rattus norvegicus 
<400> 29 

gcggctgatg gagaatactg aag 23 

<210> 30 
<211> 23 
<212> DNA 

<213> Rattus norvegicus 



<210> 31 
<211> 22 
<212> DNA 

<213> Rattus norvegicus 
<400> 31 

tgtgtcgggt cagcactgtg ct 22 



<210> 32 

<211> 1620 

<212> DNA 

<213> Homo sapiens 



<400> 28 



gtaatacgac tcactatagg gc 



22 



<400> 30 

atcttgtggc actggtggca tac 



23 
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<400> 32 

atggattcct tcaaggtagt gctggagggg 
ggcaaggact tcaatgtgcc cctctccatt 
caggccggag tggccgtggg tgactgggtg 
ctcacacaca tcgaagctca gaacaagatc 
ctcagcaggg cccagccggt tcagagcaaa 
cctccgcggt acacctttgc acccagcgtc 
gcgcccccgc ccgctgacag cgccccgcaa 
ccagatgcca gcaagcagcg gctgatggag 
acaggccagt cgcgttcctt ccgcatcctt 
gacccggatg aggagcacct gaagaaatca 
ccagcctcat ctacacccca ggagccctgg 
cgcccgccct gagctgtgga ccctgcgttt 
acagtgctga cccggcacag ccagccggcc 
attgtgcagg cagctgccgg aggggtgcca 
gtgtgtcacc agtgccacaa ggtcatccgg 
taccacccgg aggagtttgt gtgtagccag 
tttgaggaga agggcgccat cttctgccca 
tgtgccaagt gcaagaagaa gattacaggc 
cacgtgcact gctttacctg tgctgcctgc 
atggaggagg gcgtgcccta ttgcgagcga 
catggctgtg acttcaagat cgacgctggg 
tggca t gaca cctgc 1 1 eg t c € gtgega t a 
tactccaaga aggacaggee tetctgeaag 
tctgcccaca gctgccgcgg tggcccctag 
ttctgggtag ggctggcaat ggttgcctta 
cgggcccctg cccacccacc ttatcctccc 
ggtgctggcc acaccagccc cctttcacct 



ccagcacctt ggggcttccg getgeaaggg 60 
tcccggctca ctcctggggg caaageggeg 120 
ctgagcatcg atggcgagaa tgegggtage 180 
cgggcctgcg gggagegect cagcctgggc 24 0 
cegcagaagg cctccgcccc cgccgcggac 300 
tccctcaaca agacggcccg gccctttggg 360 
cagaatggac agccgctccg accgctggtc 420 
aacacagagg actggcggcc gcggccgggg 480 
gcccacctca caggcaccga gttcatgcaa 54 0 
agccaggtgc ccaggacaga agccccagcc 600 
cctggcccta ccgcccccag ccctaccagc 660 
gccgagcgct atgccccgga caaaacgagc 720 
acgcccacgc egctgeagag ccgcacctcc 780 
ggagggggca geaacaaegg caagactccc 84 0 
ggccgctacc tggtggcgtt gggccacgcg 900 
tgtgggaagg tcctggaaga gggtggcttc 960 
ecatgetatg aegtgegcta tgcacccagc 1020 
gagatcatgc acgccctgaa gatgacctgg 1080 
aagacgccca teeggaacag ggccttctac 1140 
gactatgaga agatgtttgg cacgaaatgc 1200 
gaccgcttcc tggaggcect gggcttcagc 1260 
t g t c agatfea acc tggaagg aaagaccttc 1320 
agccatgcct tctctcatgt gtgagcccct 1380 
cctgaggggc ctggagtcgt ggccctgcat 144 0 
accctggctc ctggcccgag cctgggctcc 1500 
accccactcc ctccaccacc acagcacacc 1560 
ccagtgccac aataaacctg tacccagctg 1620 



<210> 33 

<211> 1665 

<212> DNA 

<213> Homo sapiens 

<400> 33 

egaegcagag cagcgccctg gccgggccaa 
gtagtgctgg aggggecage accttggggc 
gtgcccctct ccatttcccg gctcactcct 
gtgggtgact gggtgctgag catcgatggc 
gctcagaaca agatceggge ctgeggggag 
ccggttcaga gcaaaccgca gaaggcctcc 
tttgcaccca gcgtctccct caacaagacg 
gacagcgccc cgcaacagaa tggacagccg 
cageggctga tggagaacac agaggactgg 
tccttccgca tccttgccca cctcacaggc 
cacctgaaga aatcaageca ggtgcccagg 
ccccaggagc cctggcctgg ccctaccgcc 
gtggaccctg cgtttgccga gcgctatgcc 
cacagccagc cggccacgcc cacgccgctg 
geeggagggg tgecaggagg gggcagcaac 
cacaaggtca tccggggccg ctacctggtg 
tttgtgtgta gccagtgtgg gaaggtcctg 
gccatcttct gcccaccatg etatgaegtg 
aagaagatta caggegagat catgcacgcc 
acctgtgctg cctgcaagac gcccatccgg 



geaggagecg gcatcatgga tcccttcaag 60 

ttccggctgc aagggggcaa ggacttcaat 120 

gggggcaaag cggcgcaggc cggagtggcc 180 

gagaatgegg gtagcctcac acacatcgaa 24 0 

cgcctcagcc tgggcctcag cagggcccag 300 

gcccccgccg cggaccctcc gcggtacacc 360 

gcccggccct ttggggcgcc cccgcccgct 420 

ctccgaccgc tggtcccaga tgccagcaag 480 

cggccgcggc eggggacagg ccagtcgcgt 54 0 

accgagttca tgcaagaccc ggatgaggag 600 

acagaagccc cagccccagc ctcatctaca 660 

cccagcccta ccagccgccc gccctgagct 720 

ccggacaaaa cgagcacagt gctgacccgg 78 0 

cagagccgca cctccattgt gcaggcagct 84 0 

aacggcaaga ctcccgtgtg tcaccagtgc 900 

gcgttgggcc acgcgtacca cceggaggag 960 

gaagagggtg gcttctttga ggagaagggc 1020 

cgctatgcac ccagctgtgc caagtgcaag 1080 

ctgaagatga cctggcacgt geactgettt 114 0 

aacagggect tctacatgga ggagggcgtg 1200 
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ccctattgcg agcgagacta tgagaagatg 
aagatcgacg ctggggaccg cttcctggag 
ttcgtctgtg cgatatgtca gatcaacctg 
aggcctctct gcaagagcca tgccttctct 
cgcggtggcc cctagcctga ggggcctgga 
gcaatggttg ccttaaccct ggctcctggc 
ccaccttatc ctcccacccc actccctcca 
agcccccttt cacctccagt gccacaataa 



tttggcacga aatgccatgg ctgtgacttc 1260 
gccctgggct tcagctggca tgacacctgc 1320 
gaaggaaaga ccttctactc caagaaggac 1380 
catgtgtgag ccccttctgc ccacagctgc 1440 
gtcgtggccc tgcatttctg ggtagggctg 1500 
ccgagcctgg gctcccgggc ccctgcccac 1560 
ccaccacagc acaccggtgc tggccacacc 1620 
acctgtaccc agctg 1665 



<210> 34 

<211> 223 

<212> PRT 

<213> Homo sapiens 

<400> 34 

Met Asp Ser Phe Lys Val Val Leu Glu Gly Pro Ala Pro Trp Gly Phe 
15 10 15 

£ y~rr T.on nl r-> dl -\r Cll w Ta?c 1\ cir\ "DVna Ae?n Ual t>>-^ T on Oov Tl n O^*- 7i 
-—3 — — — - -•* — — j - ^ • «-..*. i *. w j-k~>~4. i-rwj. ^J-y 

20 25 30 



Leu Thr Pro Gly Gly Lys Ala Ala Gin Ala Gly Val Ala Val Gly Asp 
35 40 45 

Trp Val Leu Ser lie Asp Gly Glu Asn Ala Gly Ser Leu Thr His lie 
50 55 60 

Glu Ala Gin Asn Lys lie Arg Ala Cys Gly Glu Arg Leu Ser Leu Gly 
65 70 75 80 

Leu Ser Arg Ala Gin Pro Val Gin Ser Lys Pro Gin Lys Ala Ser Ala 
85 90 95 

Pro Ala Ala Asp Pro Pro Arg Tyr Thr Phe Ala Pro Ser Val Ser Leu 
100 105 . 110 

Asn Lys Thr Ala Arg Pro Phe Gly Ala Pro Pro Pro Ala Asp Ser Ala 
115 120 125 

Pro Gin Gin Asn Gly Gin Pro Leu Arg Pro Leu Val Pro Asp Ala Ser 
130 135 140 

Lys Gin Arg Leu Met Glu Asn Thr Glu Asp Trp Arg Pro Arg Pro Gly 
145 150 155 160 

Thr Gly Gin Ser Arg Ser Phe Arg lie Leu Ala His Leu Thr Gly Thr 
165 170 175 

Glu Phe Met Gin Asp Pro Asp Glu Glu His Leu Lys Lys Ser Ser Gin 
180 185 190 

Val Pro Arg Thr Glu Ala Pro Ala Pro Ala Ser Ser Thr Pro Gin Glu 
195 200 205 

Pro Trp Pro Gly Pro Thr Ala Pro Ser Pro Thr Ser Arg Pro Pro 
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210 215 220 



<210> 35 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 35 

gagccggcat catggattcc 20 



<210> 36 
<211> 20 
<212> DNA 

<213> Homo sapiens 
<400> 36 

gctgcctgca caatggaggt 20 



<210> 37 

<211> 1456 

<212> DNA 

<213> Homo sapiens 



<400> 37 

cgacgcagag cagcgccctg gccgggccaa 
gtagtgctgg aggggccagc accttggggc 
gtgcccctct ccatttcccg gctcactcct 
gtgggtgact gggtgctgag catcgatggc 
gctcagaaca agatccgggc ctgcggggag 
ccggttcaga gcaaaccgca gaaggtgcag 
gtcccagatg ccagcaagca gcggctgatg 
gggacaggcc agtcgcgttc cttccgcatc 
caagacccgg atgaggagca cctgaagaaa 
gccccagcct catctacacc ccaggagccc 
agccgcccgc cctgggctgt ggaccctgcg 
agcacagtgc tgacccggca cagccagccg 
tccattgtgc aggcagctgc cggaggggtg 
cccgtgtgtc accagtgcca caaggtcatc 
gcgtaccacc cggaggagtt tgtgtgtagc 
ttctttgagg agaagggcgc catcttctgc 
agctgtgcca agtgcaagaa gaagattaca 
tggcacgtgc actgctttac ctgtgctgcc 
tacatggagg agggcgtgcc ctattgcgag 
tgccatggct gtgacttcaa gatcgacgct 
agctggcatg acacctgctt cgtctgtgcg 
ttctactcca agaaggacag gcctctctgc 
ccttctgccc acagctgccg cggtggcccc 
catttctggg tagggctggc aatggttgcc 
tcccgggccc tgccca 



gcaggagccg gcatcatgga ttccttcaag 60 
ttccggctgc aagggggcaa ggacttcaat 12 0 
gggggcaaag cggcgcaggc cggagtggcc 180 
gagaatgcgg gtagcctcac acacatcgaa 24 0 
cgcctcagcc tgggcctcag cagggcccag 300 
acccctgaca aacagccgct ccgaccgctg 360 
gagaacacag aggactggcg gccgcggccg 42 0 
cttgcccacc tcacaggcac cgagttcatg 480 
tcaagccagg tgcccaggac agaagcccca 54 0 
tggcctggcc ctaccgcccc cagccctacc 600 
tttgccgagc gctatgcccc ggacaaaacg 660 
gccacgccca cgccgctgca gagccgcacc 72 0 
ccaggagggg gcagcaacaa cggcaagact 780 
cggggccgct acctggtggc gttgggccac 84 0 
cagtgtggga aggtcctgga agagggtggc 900 
ccaccatgct atgacgtgcg ctatgcaccc 960 
ggcgagatca tgcacgccct gaagatgacc 1020 
tgcaagacgc ccatccggaa cagggccttc 1080 
cgagactatg agaagatgtt tggcacgaaa 1140 
ggggaccgct tcctggaggc cctgggcttc 1200 
atatgtcaga tcaacctgga aggaaagacc 1260 
aagagccatg ccttctctca tgtgtgagcc 1320 
tagcctgagg ggcctggagt cgtggccctg 1380 
ttaaccctgg ctcctggccc gagcctgggc 144 0 

1456 



<210> 38 
<211> 423 
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<212> PRT 

<213> Homo sapiens 

<400> 38 

Met Asp Ser Phe Lys Val Val Leu Glu Gly Pro Ala Pro Trp Gly Phe 
15 10 15 

Arg Leu Gin Gly Gly Lys Asp Phe Asn Val Pro Leu Ser lie Ser Arg 
20 25 30 

Leu Thr Pro Gly Gly Lys Ala Ala Gin Ala Gly Val Ala Val Gly Asp 
35 40 45 

Trp Val Leu Ser lie Asp Gly Glu Asn Ala Gly Ser Leu Thr His lie 
50 55 60 

Glu Ala Gin Asn Lys lie Arg Ala Cys Gly Glu Arg Leu Ser Leu Gly 
65 70 75 80 

Leu Ser Arg Ala Gin Pro Val Gin Asn Lys Pro Gin Lys Val Gin Thr 
85 90 95 

Pro Asp Lys Gin Pro Leu Arg Pro Leu Val Pro Asp Ala Ser Lys Gin 
100 105 110 

Arg Leu Met Glu Asn Thr Glu Asp Trp Arg Pro Arg Pro Gly Thr Gly 
115 120 125 

Gin Ser Arg Ser Phe Arg lie Leu Ala His Leu Thr Gly Thr Glu Phe 
130 135 140 

Met Gin Asp Pro Asp Glu Glu His Leu Lys Lys Ser Ser Gin Val Pro 
145 150 155 160 

Arg Thr Glu Ala Pro Ala Pro Ala Ser Ser Thr Pro Gin Glu Pro Trp 
165 170 175 

Pro Gly Pro Thr Ala Pro Ser Pro Thr Ser Arg Pro Pro Trp Ala Val 
180 185 190 

Asp Pro Ala Phe Ala Glu Arg Tyr Ala Pro Asp Lys Thr Ser Thr Val 
195 200 205 

Leu Thr Arg His Ser Gin Pro Ala Thr Pro Thr Pro Leu Gin Ser Arg 
210 215 220 

Thr Ser lie Val Gin Ala Ala Ala Gly Gly Val Pro Gly Gly Gly Ser 
225 230 235 240 

Asn Asn Gly Lys Thr Pro Val Cys His Gin Cys His Gin Val lie Arg 
245 250 255 

Ala Arg Tyr Leu Val Ala Leu Gly His Ala Tyr His Pro Glu Glu Phe 
260 265 270 

Val Cys Ser Gin Cys Gly Lys Val Leu Glu Glu Gly Gly Phe Phe Glu 
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275 

Glu Lys Gly Ala 
290 

Pro Ser Cys Ala 
305 

Ala Leu Lys Met 



Lys Thr Pro lie 
340 

Tyr Cys Glu Arg 
355 

Cys Asp Phe Lys 
370 

Phe Ser Trp His 
385 

Leu Glu Gly Lys 



Ser His Ala Phe 
420 



280 

lie Phe Cys Pro 
295 

Lys Cys Lys Lys 
310 

Thr Trp His Val 
325 

Arg Asn Arg Ala 



Asp Tyr Glu Lys 
360 

lie Asp Ala Gly 
375 

Asp Thr Cys Phe 
390 

Thr Phe Tyr Ser 
405 

Ser His Val 



Pro Cys Tyr Asp 
300 

Lys lie Thr Gly 
315 

Leu Cys Phe Thr 
330 

Phe Tyr Met Glu 
345 

Met Phe Gly Thr 



Asp Arg Phe Leu 
380 

Val Cys Ala He 
395 

Lys Lys Asp Arg 
410 



285 

Val Arg Tyr Ala 



Glu He Met His 
320 

Cys Ala Ala Cys 
335 

Glu Gly Val Pro 
350 

Lys Cys Gin Trp 
365 

Glu Ala Leu Gly 



Cys Gin He Asn 
400 

Pro Leu Cys Lys 
415 



<210> 39 

<211> 1575 

<212> DNA 

<213> Homo sapiens 

<400> 39 

cgacgcagag cagcgccctg gccgggccaa 
gtagtgctgg aggggccagc accttggggc 
gtgcccctct ccatttcccg gctcactcct 
gtgggtgact gggtgctgag catcgatggc 
gctcagaaca agatccgggc ctgcggggag 
ccggttcaga gcaaaccgca gaaggcctcc 
tttgcaccca gcgtctccct caacaagacg 
gacagcgccc cgcaacagaa tgggtgcaga 
tcccagatgc cagcaagcag cggctgatgg 
ggacaggcca gtcgcgttcc ttccgcatcc 
aagacccgga tgaggagcac ctgaagaaat 
ccccagcctc atctacaccc caggagccct 
gccgcccgcc ctgggctgtg gaccctgcgt 
gcacagtgct gacccggcac agccagccgg 
ccattgtgca ggcagctgcc ggaggggtgc 
ccgtgtgtca ccagtgccac aaggtcatcc 
cgtaccaccc ggaggagttt gtgtgtagcc 
tctttgagga gaagggcgcc atcttctgcc 
gctgtgccaa gtgcaagaag aagattacag 
ggcacgtgca ctgctttacc tgtgctgcct 



gcaggagccg gcatcatgga ttccttcaag 60 
ttccggctgc aagggggcaa ggacttcaat 120 
gggggcaaag cggcgcaggc cggagtggcc 180 
gagaatgcgg gtagcctcac acacatcgaa 240 
cgcctcagcc tgggcctcag cagggcccag 300 
gcccccgccg cggaccctcc gcggtacacc 360 
gcccggccct ttggggcgcc cccgcccgct 420 
cccctgacaa acagccgctc cgaccgctgg 480 
agaacacaga ggactggcgg ccgcggccgg 540 
ttgcccacct cacaggcacc gagttcatgc 600 
caagccaggt gcccaggaca gaagccccag 660 
ggcctggccc taccgccccc agccctacca 720 
ttgccgagcg ctatgccccg gacaaaacga 780 
ccacgcccac gccgctgcag agccgcacct 84 0 
caggaggggg cagcaacaac ggcaagactc 900 
ggggccgcta cctggtggcg ttgggccacg 960 
agtgtgggaa ggtcctggaa gagggtggct 1020 
caccatgcta tgacgtgcgc tatgcaccca 1080 
gcgagatcat gcacgccctg aagatgacct 1140 
gcaagacgcc catccggaac agggccttct 1200 
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acatggagga gggcgtgccc tattgcgagc 
gccatggctg tgacttcaag atcgacgctg 
gctggcatga cacctgcttc gtctgtgcga 
tctactccaa gaaggacagg cctctctgca 
cttctgccca cagctgccgc ggtggcccct 
atttctgggt agggctggca atggttgcct 
cccgggccct gccca 



gagactatga gaagatgttt ggcacgaaat 1260 
gggaccgctt cctggaggcc ctgggcttca 1320 
tatgtcagat caacctggaa ggaaagacct 1380 
agagccatgc cttctctcat gtgtgagccc 1440 
agcctgaggg gcctggagtc gtggccctgc 1500 
taaccctggc tcctggcccg agcctgggct 1560 

1575 



<210> 40 
<211> 153 
<212> PRT 

<213> Homo sapiens 
<400> 40 

Met Asp Ser Phe Lys Val Val Leu Glu Gly Pro Ala Pro Trp Gly Phe 
15 10 15 

Arg Leu Gin Gly Gly Lys Asp Phe Asn Val Pro Leu Ser lie Ser Arg 



Leu Thr Pro Gly Gly Lys Ala Ala Gin Ala Gly Val Ala Val Gly Asp 
35 ^ 40 " 45 

Trp Val Leu Ser lie Asp Gly Glu Asn Ala Gly Ser Leu Thr His lie 
50 . 55 60 

Glu Ala Gin Asn Lys lie Arg Ala Cys Gly Glu Arg Leu Ser Leu Gly 
65 70 75 80 

Leu Ser Arg Ala Gin Pro Val Gin Ser Lys Pro Gin Lys Ala Ser Ala 
85 90 95 

Pro Ala Ala Asp Pro Pro Arg Tyr Thr Phe Ala Pro Ser Val Ser Leu 
100 105 110 

Asn Lys Thr Ala Arg Pro Phe Gly Ala Pro Pro Pro Ala Asp Ser Ala 
115 120 125 

Pro Gin Gin Asn Gly Cys Arg Pro Leu Thr Asn Ser Arg Ser Asp Arg 
130 135 140 

Trp Ser Gin Met Pro Ala Ser Ser Gly 
145 150 



<210> 41 

<211> 24740 

<212> DNA 

<213> Homo sapiens 

<400> 41 

nnnnnntgta ttttatcata ttttaaaaat caaaaaacaa aaggcagttg aggttaggca 60 
tggaggttcg tgcctgtaat cccagcactt tgggaagccg aagcacgtgg atcacctgag 120 
gtcaggagtt cgagaccagc ctgcccaata tggtaaaacc ctgtctctac taaaaataca 180 
aaaaattagc caggcatggt ggtgggcacc tgtaatccca gctacttggg agactgaggc 24 0 
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aggagaatca cttaaacccg ggaggcgggc tgggcgcggt ggctcatgcc tgtaatccca 300 
gcactttggg aggccgagac aggcggatca tgaggtcagg agatcgagat catcctggct 360 
aacatggtga aaccccatct ctactaaaaa tacaaaaaaa attagccagg cctggtggcg 420 
ggcacctgta gtcccagcta cttgggaggc tgaggcagga gaatggcgtg aacctgggag 480 
gcggcgttgc agtgagccaa gatcgcgcca ctgcactcca gcctgggcga caagagtgag 540 
actccatctt aaagaaaaaa aacaaacccg ggaggcggaa attgcagtca gccgagatct 600 
cgccattgca ctcaagtatg ggtgacagag caagactcca tgtcaaaaaa aaaggcagtt 660 
gacaggagca aggagcctgg tgaggaagct gtggcatttg acccggctgt gttgctatgg 720 
gccagggtgg tgctagtaga ggagctgagt gggaaagagc acaggggaca tgctgaaggc 780 
ctgggtgtgg ggatgaggca gagattgggg gcaccttgca gggtcatagc aggtggctgt 840 
ggtgagatgg aggaagacac ctggggtact gctctaggct gtcagacata cagaagctgg 900 
cccagccaag cccaggggct gcaagggaca tccttttgtg tccccagtga tctgcagctc 960 
tcagacaccc tcaagcacag tgcctcttgc ccagcccagc actctcagtg gggagccagg 1020 
tgggagaaca ggctcggaag gggacctagg cttatgcagc gagccgggca aagctggaac 1080 
tggagcccag gcccctggat gccccctggc ttgtggagtt ctgggatact gaggggaggg 1140 
gacagggcat gggagtgcgg tgctctcacc tttgacttga actcattccc caggggacag 1200 
gggaggcctc ctcaggatcc acagatgccc agtctcccaa gaggggcctg gtccccatgg 1260 
aggaaaactc catctactcc tcctggcagg aaggtaagtt ggaggacgtg caagggcagc 1320 
ctcagccccc cacacccagg gctgggtctt tttgggactg acggagctgt cctggccacc 1380 
tgccacagtg ggcgagt ttc ccgtggtggt gcsgaggact gaggccgcca cccgctgcca 144.0 
gctgaagggg ccggccctgc tggtgctggg cccagacgcc atccagctga gggaggccaa 1500 
ggcacccagg ccctctacag ctggccctac cacttcctgc gcaagttcgg ctccgacaag 1560 
g tgagg t gca gggg tgggaa aggg tgaggg gctgacagcc tggaccctcc tgctaatccc 1620 
cacccgtgtg ccctgtgccc agggcgtgtt ctcctttgag gccggccgtc gctgccactc 1680 
gggtgagggc ctctttgcct tcagcacccc ctgtgcccct gacctgtgca gggctgtggc 174 0 
cggggccatc gccgccagcg ggagcggctg ccagagctga ccaggcccca gccctgcccc 1800 
ctgccacggg ccacctctct gccctccctg gacacccccg gagagcttcg ggagatgcca 186 0 
ccaggacctg agccacccac gtccaggaaa atgcacctgg ccgagcccgg accccagagc 1920 
ctgccgctac tgctaggccc ggagcccaac gatctggcgt ccgggctcta cgcttcagtg 1980 
tgcaagcgtg ccagtgggcc cccaggcaat gagcacctct atgagaacct gtgtgtgctg 204 0 
gaggccagcc ccacgctgca cggtggggaa cctgagccgc acgagggccc cggcagccgc 2100 
agccccacaa ccagtcccat ctaccacaac ggccaggact tgagctggcc cggcccggcc 2160 
aacgacagta ccctggaggc ccagtaccgg cggctgctgg agctggatca ggtggagggc 2220 
acaggccgcc ctgaccctca ggcaggtttc aaggccaagc tggtgaccct gctgagtcgt 2280 
gagcggagga agggcccagc cccttgtgac cggccctgaa cgcccagcag agtggtggcc 2340 
agaggggaga ggtgctcccc ctgggacagg agggtgggct ggtgggcaaa cattgggccc 2400 
atgcagacac acgcctgtgt ccaccctggc ctgcaggaac aaggcaggcc gcctgtggag 2460 
gacctcagcc ctgccctgcc ctcctcatga atagtgtgca gactcacaga taataaagct 252 0 
cagagcagct cccggcaggg gcactcacgg cacacgcccc tgcccacgtt cattgcggcc 2580 
aacacaagca ccctgtgccg gttccagggg cacaggtgac ctgggcctta cctgccaccc 2640 
gtgggctcaa acccactgca gcagacagac gggatggaaa tcattaggac tccatgttgc 2700 
tctgcacggc cgagtgacac gaagaggcag gcggagggag ctgtgaggct tacttgtcag 2760 
actcaggaag gagcaacatg agggcccaac tggagacccg gaggcccgag ctgggaggag 2820 
gcagtggggg cggggtgcag gtggaaggga tttcagagac accctcgtcc aaaacacttg 2880 
ttccctgctg aaactccaac aatttgcaga tacttctggg aaccccaggc gtcagtctcc 294 0 
tcatctgtaa aggagagaga accgatgacg tatcaggcat aatccttgat gagagtttgc 3000 
tgcgtgccta ctcagtgcca ggcgctgggg gacacagccg tgttcaggac agccttggtc 3060 
ctgttctccg ggagccgaca ttccaggggg agagaagttt cctgaagact tccatgctgc 3120 
gttccctcct ctgctcctgc tcctggcgcc atcctaggag ccagccatgc acgcaagcgt 3180 
catgcctcca gggctctgac tgcccagccc ctcaccgcaa ctccacctca gctgcacaca 324 0 
cccttggcac atcctgaacc tcattttcat gacggacaca caatttttgc tctctcctgt 3300 
ccaagcctca tcctctggcc gccacctcct tccagctcac ttcctttagt gcggccagta 3360 
ccgcccctgc ctaggcatgt cgacctgcag ggaccctttt ctggctcttc gaggcctctg 3420 
cccaccatcc cctctttgtt ctccatagtc ccttccccct gttctctctc gtttcatctt 3480 
actggtctgg caaagtcccc ggccttgggc gagccagacc tcctcagtgc ctgcacacag 354 0 
ctgcccacag ccagagaaat ccatttaagc agactgcctg catccttctt aacagtgcaa 3600 
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ggcaggcact ccctgccaca agagaccctg 
agaacctcct gtctatcccc acccaatgtc 
ggctctccca ccgtatctgc aagtgtactg 
gtagaaacag tgtctgtccc cagctgtgct 
ggaaaattga gctgtcccca ctctctcctg 
gccacacgca aacatgctct gtaatttcac 
tcttcggcca cctctgtttc tctgttcctc 
tgattacttt catttccact ttctcacccc 
cccatgatgc cattatgtgg cttttattag 
caacaaggac tttgacttct cagcagcact 
ttgctattcc tcctacctca taacaatctc 
gttcttccca gggtcctagg ctcagatgta 
tcctgaaagc ctgtaaaaat gtccatgcat 
aaattttaaa aatcaaatgc ccatgcctgg 
tctggtaggt gggttctggg catccacgct 
gcctaggctg ggaaccactg gcctcagcaa 
ctaggtgcta atgaccccga aggtctgtct 
ttaggaatcc cactgcacac gagacatctg 
ggtgaaaggg ggttggggga atggagactc 
atgcaaagcc tggagggatg ctagagccat 
gcaagcactt cctgatgggt aaggtcttag 
aggtgctgct gcaggcaccc ccactcccta 
get ca aa t c c~~c iaggacaege^ tga agg t ca c 
aacccaggac tctggggacc cagcatggca 
ttgtttccta gggatttctg tgatattcca 
accaccagga egctagagge ctggcctgga 
tgggtccttg tccaggtcca gttgetaege 
gggacacaga ccagagcctt gcaccagccc 
ggaggactgc caggaccccc aggctgccgc 
tgtcaatcct tcctttcttc gtgtgttcat 
atgttagata agacacatta gctgtgtgtc 
aaagagcacc gcagagagtc cccattcccc 
gccttttcca catccagatg tctctggtgc 
tacagaatgt gagatgatgg agatgetatg 
gtgtgatgcg ggtgaggcat ccagggattg 
ggaggaggee acatctgagc agagacctaa 
tggagaagtg tttcaagggc cgggcaccgg 
ctttgggagg ccaaggcagg tggatcgctg 
acatggagaa accccatctc tactaaacat 
cctgtagtcc cagctactcg ggaggttgag 
aggttgeage aagecgagat cacaccactg 
cgtctcaaaa aaaaaaaaga aaagaaaaaa 
gaactggcaa gtggagaggc cctgaggcag 
agtgaggagg tcagggtggc tggagtggag 
attcatattc tgtaggtctt aaggacttca 
ggaatgctaa aagctgagag tgacatggtg 
ggctgetteg tgaagactct ggaaggggca 
ccgttacagg ggegcaggea caaaatggca 
agatgtgaac atgttgaagg tggaatttgc 
taacttcatc gtttctgctg aaccagttgg 
gaaggagtta ggaaggtggg aaaggcacaa 
cctagtacag tggttcccca acctttttgg 
tttccacaga ctgggggtgg ggtggggatg 
atcattagac tctttttttt tttttttttt 
tggagtgcag tggagecate ttggctcact 
ctcctgcctc agcctctcaa gtagctggga 



ttccctagta gggcagcttt tctcctcccc 3660 
tcctcacagg catattgggg aaacaggtca 3720 
gcatccatct gtcttcttcc tacccctaca 3780 
ctgatcccgg ctcctttcac ctcagagctt 3840 
cgcccattca tcctaccagc agcttttcca 3900 
attttaaacc ttcccttgac ctcacattcc 3960 
ttcacagcaa aaactgttca aaagagttgt 4020 
cattctctcc tcaattaact ctccttcatc 4080 
agtcaccaac cttattctcc aaaacaaaag 4140 
cagctctggt tcttgaaaca cccccgttac 4200 
cttcccagcc tetactgetg ccttctctga 4260 
gtgtagctca accctgctac acaaagaatc 4320 
gttctgtgag tgatctacca agaaaataaa 4380 
gcccacacgc aggggctctg atttcatcag 444 0 
cactggattt ceggatgatt gtagtatgea 4500 
gecagtcatt ctccaggtgt cacagaccct 4560 
teagtgeaca cctccccctg agctccagat 4620 
gatgtggaaa agacatctcc agatcccatg 4680 
gtgttcttcc aggatgtgtg tggacacaga 4740 
agggaggaag attteggetc acttattcat 4800 
agcaagctga ggecaagagg egggcagteg 4860 
cagtggcaag cccaagccca gcccttggca 4920 
ccagagagtc^ aggggcatgg^ctagaaccag 4 980 
tcctttcctt cattacaaat ctgagctget 5040 
aggggactgt gggaaagaaa gtccttggaa 5100 
gectcaggag tctcggccac cagagggege 5160 
aggggctgee tgtgctggga ggctccccag 5220 
aaggaatggg agcctggggt cctctctgct 5280 
ctcttccttt getcatttge tgtttcactt 5340 
tcacatccac tgtgtgctgg ccctggggaa 5400 
ttcattgtcc taacaaagaa cacaccctgg 54 60 
catctccctc cacacatgga atctggagat 5520 
tgtgggattc ttaaataaac aaacatttca 5580 
gggaaaagta aagcagaggg agggectagt 5640 
ctgtttcagc tgtgatcagg aaaggccctg 5700 
ataaagttgg aaacctgttg ctgagatatc 5760 
gcatggtggc tcacgcctgt aatcccagca 5820 
gaggtcagga gtttgagagc agcctgacca 5880 
ataaaaatta teegggcatg gtggttcatg 5940 
gcaggagaat cacttgaacg tgggaggcag 6000 
cactccagcc tggatgacag agegagaetc 6060 
gaaaaaaaaa gaaaagtgtt tcaagcaggg 6120 
aaatatgett ggcctgctgg aggaaatgtg 6180 
ggagcgagtg gtaggagtca gacccagttt 6240 
gttttatttt gagtgeaata tgagcccact 6300 
ctgtgattct ggctttaaaa atatcacttt 6360 
agggtgaaag cagggatgee cgttaggaga 6420 
gtggctggga caatggtggc ageageggtt 6480 
agaatctggg ggaggacaga agagaaagga 6540 
ataaatgttg gtggcacttc ttgaagtgag 6600 
gtttgaattg ggccatgatg gtctgagata 6660 
cagaagggac cgctttcatg gaagacaatt 6720 
gtttcagggt ggttcgagtg cagtacattt 6780 
tgagatagag tctcgctctg tcacccacac 6840 
acaacctctg ctgcccaggt tcaagtcatt 6900 
ttataggcat atgcgccacc acgcccagct 6 960 
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aatttttgta tttttagtag agacggggtt tcaccatatt ggccaggatg gtctcgaact 7020 
cctgacctca agtgatcctc ccccgcctca acctcccaaa gtgctggggt tacaggcgtg 7080 
aaccactgca cccggcccat ttatcattag attctcataa ggaatgagca acctagatcc 7140 
ctcgcatgca cagttcacaa tagggttcac gctcctatgg gagtctaatg ctgccgctgc 7200 
actcagcttc tctggcttgc cgctgctcac cttctgctgt gcagcccagt tcctaacagg 726 0 
ccacaaacgg ggagttgggg acccctgatc tagtaaacat ctaggcaggg ttttggataa 732 0 
tggagttaga gttcctgggg agaggtcagg ctggccatga aacatgggat gcctttgcat 7380 
ataggtggtg ttgaaagcca caggacagta cggggtctca gggggtgagc ataaagagag 744 0 
gcgacatcag atggccaagg ccagaggcag aggaggatgg gaaggagggg ccagtggggc 7500 
agggggaagc tgtgaagcca gggaaaaagg gtgtttcgcg gaaaaggatc aacctggacc 7 560 
agtgctgccc ctaggcaggg caggatgaaa cttaaccacc acggattcca tggccccatg 7620 
gcctccaggc cacaggggac cttgagaaga gagatctcag gggacgggtg cggacaagag 7680 
cccgcctggc atggcttcaa gagataactg aaggaaagca agtggagacg cgataaacag 7740 
acaactccct ggaggaattt tactctcgag aggagaatta aagggtagta gctggagagg 7800 
gatgtggggt caagagaagg tctttaacga cgagaactct cacggcggtt tgtgcagaac 7860 
agggtgggtg tgatgactgt ggatggagag gggagaactg cagcgactct gtcctaggag 7920 
gaggtgatgg gccgggacca ccaagcgagt ggagggtgga cgccccttcc ctcaccccga 7 980 
cacccgcatg tgctcagtgt ccgtgccgcc ggccctagtg cctgggctga acgcggggcc 8040 
gggactctga ggacgcctcc caggcgcgca gtccgtctgg ccaaggtgga gcgggacggc 8100 
ngcttccgac ggtgcgcggg tcggctcggg gttgcaggga catccggcgt ccgctcctgc 8160 
cctgttttcc tgccttcgca gagcgttgcg caactctagc tttaaacgcc cctgtccccc 8220 
tcaacttgtc tcccccagcc cctctgattt acagattctg cagtccccga gggttgcgcc 8280 
tacgataccg acactcgcgg cagcctgcga ggcgagtatg atcgtcccat ttttcggagt 8340 
agcaaactaa ggttcagaga ctactatgtc ccaggtcggt ctggtttgaa ggtccgcttt 8400 
cctctccctc cgccagcggg cggtgcgagg gactgggcga ggcagcgctt ccctaaggag 8460 
gcgacccgca gccccggccc cctcccgact ccgccccgtt gcagggcccg ggtcggcgag 8520 
gcctctcagc tctaagcccg acgggacttg gtgattgggc aggacggaag agctgggtgg 8580 
ggctttccac cagcggagaa agtctagtgg gcgtggtcgc gacgagggcg tggcctggtg 8640 
ccccgccccc gtccgcgcgc tcaaagtgga gggtggctgt gggggcgggg tcagaacact 8700 
ggcggccgat cccaacgagg ctccctggag cccgacgcag agcagcgccc tggccgggcc 8760 
aagcaggtat cgacgaccgc gcggggcgtc ttgggctgga ccaggcgggc gcccggggcc 8820 
tgctgaggac cacaaagggc actgggggtc gtggtccagg ctgtgcttcc tcccgctggc 8880 
cctggcccct gcctccgccc ccgcccccgc cttcctgccg ctaagccggc tgcggcgggg 8940 
ccgattggcg cctgccggct tcctgcgccg gggccagtct aatgcatggg gcccgggcgg 9000 
gggactaagg ggaaactgag tcacgtcggt gtgggagcag ttctgtgtgg gaggcaccac 9060 
cccccactgg gctcggggaa ggatccccct ccaagctatg cttgagggtc ccagccccca 9120 
tctgtctcca caggggccgc accccactcc cgccttcccc ttcttcagca cccaggggtc 9180 
ccgccctggc tcccagcagc ctcgactggt cccggaatgg ctaggaggat ccgctgcagc 9240 
cgcctccctc ccctcccctc ccctcccctc ccctcccctc ccctcccctc ccctcccctc 9300 
cccctcgcgt cccaagcccc cgtgtgctcc ctccgctggc tctccgcaca gtgtcagctt 9360 
acacgcctta tatagtccga gcaggctcca gccgcggcct gctgccggga cctgggggcg 9420 
ggggagagga gagccggccc ctgactcacc cggaccgccc gaggctccag gctggcttgg 9480 
ggggaggccg cgccagttta gtccctcggc ccacccctgg ttgcaaagaa cctcaagcct 9540 
ggattcaggc acccctcacc gttccagtcc caaggggagg ggggctgctc ctgtctttcc 9600 
aaagtgaggt ccgccagcca gcagcccagg ccagcctgac aaaatacctg cctcctatgg 9660 
cttgggcgtg ctcaggggct gcccgtgcct gcctggcccc tgtccaaggc tggtatcctg 9720 
agctggcccg gcctgcctgc ctgcccgccc accatgctgg ccactcacct tctcttctct 9780 
cctctcagga gccggcatca tggattcctt caaagtagtg ctggaggggc cagcaccttg 9840 
gggcttccgg ctgcaagggg gcaaggactt caatgtgccc ctctccattt cccgggtgag 9900 
cctaggtttg gggagggggc tcccccagcg gtctttcggt gcttaggtct ccagagggtg 9960 
atggggggag tcctaacagg agctggtcag gggccagcag gccaggagat gtctaggtcc 10020 
ggagatgtag tggtacctgc ctgccacaag gactcccaat gaggtggata ctgggaggga 10080 
gcacccaggc ttctccagcc ctgcactgta cccgatgctg ttctcccaag ctcctgtggc 10140 
cacctctgag ggctggaggg aggctcattg tgcaggatgg gagcctaaca tttcaggagg 10200 
tatctaaact tgaggtggca atgcttggag ccaggcccca ggcaggacac tgtgactata 10260 
ggatttcact tcagcctcac tgccgcccag ggaatagcaa tcctcatccc gtttttccag 10320 
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atgagagaag aactcatgga gaggtggcgg ggctcgctca tcgagtccat ggtgaagcag 10380 
ggattggaat tgaggcacag catggcgtac attttttgtg ggtagaaggg gtctctcccc 1044 0 
agcctatgta aggacccaca tccactgttc ccattcagga tgtggtggcc tttgacccca 10500 
agcagaagtg taggacaggg ctccattcta ggggcttaac ttcagcttcc aagagcctgc 10560 
cctggtgtgg gtggagctgg aggctggctc ctccctgtag cagggggatt gccttataag 10620 
cccaagaatg cagccccacg ctgggatggc caacagtggc tgcggtctgc agagctgaaa 10680 
agggctggcc taggcctggc cccctgaacc ccactggtgg gcctctcagc tggtcaccag 1074 0 
gctgcagctc cagctgtatg gtccagttgt gagacacaac aaattgcctg cccagagtgg 10800 
gtgaggccag cctgtcggct ggcatctctg actggcctgg gggtcaggag ggggtgggga 10860 
cttcctgccc ctatatccgc ctgccccgag agacccaccc aggcgccggg tgggcaggca 10920 
gctgttgtca ggaagcccaa ggcaagccca gcctggaggg gcccagaggg tcgtggcctg 10980 
aggaggggct caagctggag tctgtctgta ggagctgggc gtgggggtta gggtgggcag 11040 
gccagcagtg ctcttctcag gggtcctttg atggcattct cctggaacct gccccgccag 11100 
cagggtagtg aggcagtggt tgccctatga cacacgtccc actacatagc cctcacacag 11160 
ccctgaaacc tacctgacgt cctgctccct gggaaagtgc tggcccagtg tgtctgggga 11220 
gcctgaacct cagtttcttc cctgatggag atgactttca gatatggcct gttgggggca 11280 
ctccgggctc cagctccctg gtcagcatcc ctggcatgtg ggcggggcca ctagctgatc 1134 0 
ccagccctgg agttggacct gggcccacat gggtgggtga ggtgggcttt tctgagttag 11400 
gccagccccc tccccctccc ctgaccccag aatggaggga ggtgggaggg gcaagggctg 11460 
gctgtgggcc caggcctggg agatgaggta acgtctggga ctggggggct gggctgctca 11520 
ggctgactca cccccacctc atgcagggtc cagccccctg gctttttccc tccttggttc 11580 
ctctggcctt accctgcccc tggcttgagc ccctccctgc ctctctccag ccacccgccc 11640 
agcgc tgtct tctgctct cc tgctgccctc" cccacgctct gaa cacccct ca tcctctgt 1 1700 
gcttcctgcc ctcctcactc tgggaaggga agccgtcccc gccccccacc ccctctccag 11760 
gagccagcta gctgcacccc aagaccccca cctcgggctc agcccacagc tcccaggagc 1182 0 
cagccctgtg ggcagggagt ggctgggcca ggtttccctt ctactgactc accatgacct 11880 
tgagtaagtc acttcccctc tggggtgtca cttccccata cacagtataa ggggttgatt 1194 0 
tagttggatt gaactaaagg tgagggagtg gctcagggtg tctccaggtg ggctgacccc 12000 
tcagttgggc ccccatgctc agcagaggtg gcccacagtg gtggagcctt agggtcagag 12060 
acacttcctg gctctgcctc ttactagctg ggtgacttga ggcaagttgt ttaacctctc 12120 
tgtgtacatt tgcaagtgca aaatgggtaa aatcccagat tactccacaa ggttgttgga 12180 
agattcagtg tcaatatgta gcatagttgg tgctcaataa actgaagcaa gtcttcttat 12240 
ttagcgagtg aggaaggggc cgccgagctc tcttagcctt ctgacctcct acgcaagcaa 123 00 
gaggtcatgt tgagcccagc tcgcctttct tttcccagtg ctgtcaagct ctgtgcctgg 12360 
ctgccctgcc ctctgacatc tctctgaaac ctcttgcctc ccctctccct gcctcagctc 12420 
agtctgtgca ctgacccacc tgaggagcct cctggggcca ctggcagcct ggaccccccc 12480 
agatcccccc cacccagtga aattgtcttc cagcactgcc tcacaaaagc ctacttgatg 1254 0 
cagtgcca^g cctcttgcca gatggctggg tggtccctta ggcttggacc cagtcaagct 12600 
gccctgcctg tgttgctggg gctgggctag aggcctggaa ggggtttatc agggtcaccc 12660 
tctcagggcc tgggagatac ccaatcccag acattaaaac tgccagtagc ccctctacct 12720 
tcaaagccaa gtcctggtcc cttcccctgg cattcaaagc catcgtaagt gaactctcac 12780 
ccgctaggca gcacacgcca ttctccttta ccgaggccca ccgcttcctc aaagtcattc 12840 
ctgatggtct cagctcatgc tggtggcagc catttctccc agcctactgt ctctactcat 12900 
tgccacagga accagggact cccagctcaa gagcctgaag gattggggtc aggggaaatt 12960 
ggcagtcgag ggcttgggag tgacagccat gtatggccta cgaagtccca gctgtcaact 13020 
taggtcccat tcaggcagtg ttcacaggga accgggagat aacagggcct gttcctggct 13080 
ctcaaagggt cccagcagac ccctatagat ggcccccgac agggtgctgg ggggtgagag 1314 0 
gtccataaga gcccccggtg gtttcgggga ggaagctgcc ccctgcatgg gccagagggc 13200 
atatctggta ggtggagtgg cctgggcagg aggccagcag gagcctcaaa aggcaatggt 13260 
cctcctgaaa cacttgggct ttagcctgag cgtggctgtt tgtggacatc atagcaattt 13320 
ctggactgtg ggggagggtg gtggcggtga atagataagc atcgtgactg gggaagctca 13380 
ggtgagcacc acctgaggga gagggtctgg cagtgaataa ataagcagtg tgactgggaa 1344 0 
attgtgaagc tcaggtgagc gccaccacct cctgggttgc tttagtgtcc agcagctgcc 13500 
tagaactatg ttgaatgaag agctctctgg gttctggaag tgggacagct ttgggtgggg 13560 
cagtgttacc accgtcagcc tggcttgggt ctgcagggtc cagggcctcg gtcactttgc 13620 
ttctctctcc acagctcact cctgggggca aagcggcgca ggccggagtg gccgtgggtg 13680 
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actgggtgct gagcatcgat ggcgagaatg cgggtagcct cacacacatc gaagctcaga 13740 
acaagatccg ggcctgcggg gagcgcctca gcctgggcct cagcaggtat gcgggtggac 13800 
atggatgggt gcgcccgcgc tggcagtggg gatccctgcg gcccggcccg ctgtcacgct 13860 
ttccttctcc tccagggccc agccggttca gagcaaaccg cagaaggtac gaggctggcc 13920 
gggacatccg ggcggtgggc ggtgtgggct tggacggcca ggcctgctcg ccctcctggc 13980 
acattctcgg taccccaatc cctggccggg agtggagggc agaaaccgga gctaaggcgg 14 040 
gtctagggcc ctggagttga gccaggggct gctgcacggt cctggcacca cgcatgtccg 14100 
cctgtctgtc cgcctgtctg tccgcctgct gcctcccgcc gccggcgctg cgtgctcgcc 14160 
cgcactcggt cagccctcgg tcctgcgtgg actgagatcg ccactcccaa atgggcccct 14220 
tgaaacctga gtcgtcctct ccccgtagcc tccaaataga tgtagggggt ggggtggggg 14280 
tggggggctg gagctgccgc tgtcctctgc tgcaggcgcc ccacttccac ccaggccccc 1434 0 
accttaccct gcccgcccgc cctgcccggc tgtgtctctg cccaggcctc cgcccccgcc 144 00 
gcggaccctc cgcggtacac ctttgcaccc agcgtctccc tcaacaagac ggcccggcct 14460 
ttgggcgccc ccgcccgctg acagcgcccc gcagcagaat gggtacgtcg gcccctgccc 14520 
gcccgcgccc acgccatcag gcccactgtg gccccacgcc cgctgcccgc tgctgctcag 14580 
tctgtgctgc gccccagccc ggcggaaccg tgcggcacgc cccctggcgg ccggggtggg 14640 
gctgcaggca cagggcccct cccgaggctg tggcgccttg cagggcaccg cctggggagg 14700 
ggtctctgaa tgacgccgcg ccccctgctg gcggctgggg gttgggttgt ggtgtcgggc 14760 
cagctgagcc ccagacactc agtgccgcct tgtccccggc tgttctgacc cctccccgtc 1482 0 
tttct.t.cctc tcctgtgtct gtccctttg.t ccc.ttta.tct gtctg.tctg.t cttatttcct 14 880 
tcacaggtgc agacccctga caagtcagtg agcccccctc tgcctgtgcc tttcttcttc 14 940 
cttttggcac tctgggtggc ggcccctccc caccctggct gccctcctct ccacttcgcc 15000 
1: : t c c tg tc c t" c t ca cc t a c c cgcccagcag g^ctcdt^gc ^t^a^c^tta^cccactccct 15060 
cccatcactg taacccaaac ccacatgcac caaatcctgg gaggggctgc ccccaccgcc 15120 
cacccccagt gtggggttct gagccacacc ctccccacag acagccgctc cgaccgctgg 15180 
tcccagatgc cagcaagcag cggctgatgg agaacacaga ggactggcgg ccgcggccgg 1524 0 
ggacaggcca gtcgcgttcc ttccgcatcc ttgcccacct cacaggcacc gagttcagta 15300 
agtgccagcc cagggcaggg ggtactttcc tcgcccccag cccaggcgtg atccctgacc 15360 
ctgtgtcttt tttggtcaat gcctgcctct gctctctcag tgcaagaccc ggatgaggag 15420 
cacctgaaga aatcaaggta cagggacggg caccagcccc tctcccacct cctgcctctt 154 80 
ccattccagc tactgccctg tgtctactcc tgaggctccc agctggggct ctcaattctc 15540 
ccttccttcc ttccttcctt ccttccttcc ttccttcctt ccttccttcc ttccttcctt 15600 
cccttcctcc ttccttcctt ctttcatttc ttccctccct ccttccttcc ctcctccctc 15660 
cctgcctccc ttccatctct ccttccttcc acttcttcct ccctctctct ctgcccctca 15720 
gggaaaagta tgtcctggag ctgcagagcc cacgctacac ccgcctccgg gactggcacc 15780 
accagcgctc tgcccacgtg ctcaacgtgc agtcgtagcc cggccctctc cagccggctg 15840 
ccctctctgc ctccctcttt ctgttcctcc tgcccagggc acccccttag tgcctccagc 15900 
ttctgcctac ctcacccccc ctttcgtgcc cctggcctga gcctcctgct ggcctggccc 15960 
tggccgccca cctgggttca tctgacactg ccttccctct ttgccctgtg gtactgctgt 16020 
ctgccaggtc tgtgctgcct tgggcatgga ataaacattc tcagccctgc ttgctctgcc 16080 
tgtcttctat ctttgtggac ctggtttgca tttggggtgt gggggtgttt cgtggttcgg 16140 
actgtttggg ccctgccgtc cttgttttca gtgggagggg gtacctggca aaggggccct 16200 
gccctgccat cacagatggc ttcctggcat gaggggagcc ccaggagctg cctcagaagc 16260 
gggagccctg cctcgtctcc cagctagaga ccgcacacca gctaactgga cattgctagg 16320 
agaagctgcc cttcccatcc ctaccccagt gggacctgga atccaactcg gcagtttcca 16380 
cgcccccagt catctcccgt ggggccagca ggacccaggt tggggggtgg ggccatgtca 16440 
ggaagctcag ccatgcaggg ccttgaatgg cagatcttgc agccaggtgc ccaggacaga 16500 
agccccagcc ccagcctcat ctacacccca ggagccctgg cctggtgaga gggagtgggc 16560 
tcgggcctgg gcaagggtgg gcagcctcca ggggcatggg ggtggtgggc ttctctcagc 16620 
tgcctggggc tccacccccg tcctttgggg tccctgggca cccctttaga gtcactttcc 16680 
ccggcaggcc ctaccgcccc cagccctacc agccgcccgc cctgggctgt ggaccctgcg 16740 
tttgccgagc gctatgcccc ggacaaaacg agcacagtgc tgacccggca cagccagccg 16800 
gccacgccca cgccgctgca gagccgcacc tccattgtgc aggcagctgc cggaggggtg 16860 
ccaggagggg gcagcaacaa cggcaagact cccgtgtgtc accagtgcca caaggtcatc 16920 
cggtgggtgg cctgttcctg tccgaccctg gctttcccat cctgcagccc agccccacct 16980 
gtctgcccac ctgtcttgcc tcagctgcga ctggggggaa taaggattca gttctcagct 17040 
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ggagtaggag tagggacctg ggctgggtcc tcccattctt aatcccacgc tacctacccc 17100 
agcccaccca caacaactgc tagcagcatc tgccgtggcg aaatagccga agggccaacc 17160 
ataggctgaa gctgcacccc tacctttgct gctctctggg caaagagggg cctgccccct 17220 
cccagcgcgt ctgcccctcc ctcctgctct ctgtctccct ctgctctcag agcatacagg 17280 
cctggagcca ctccctctgt gcactgcccc gtggggccaa gcagcatcaa acacccccca 1734 0 
gcatcagcgt gccggattct agagccttcc taattcgcag gcctggcctg ctctcatctc 17400 
tgtcagctct tttttttttt tttttgaaac agagtctcac tgtgttgccc acgttggcgt 17460 
gcagtggcgc gatctcggct cactgcaacc tctgcctcct gggttcaaga gattctcctg 17520 
cctcagcctc ctgagtagct gggattacag gcacccgcca ccatgcctgg ctaattttgt 17580 
atttttagta gagacggggt tttaccatgt tggccaggct ggtctcaaac tcctcacctc 1764 0 
aggtgatctc aggcctgcct tggcctccca aagtgctggg actacaggtg tgagccactg 17700 
tgcccagccg actctatcag ctcttgccag gtagaacagg caggccagca ggacagggca 17760 
gctccagggt ttgcccaggg gcggctcagc ttttatgagrg ctccagtcgt cagcccttcc 17820 
tcccggggtc ctccctgctc taaagctgcc tctcctgtca ccagcagttc agtgtggcgg 17880 
actggctctg taagcttcat ggctgccacg gtcacttccc aagcctgtct tctatcctat 17 940 
gtggaaaatg gggagaatga actgtccctc ccaaggcctc ctggtgggtg gtcagtcaac 18 000 
ctgaaggggg ccaagacccc cacctctctg cgtgtgctcc ctctgaccgc tctcgcctcc 1806 0 
ctgcaggggc cgctacctgg tggcgctggg ccacgcgtac cacccggagg agtttgtgtg 18120 
tagccagtgt gggaaggtcc tggaagaggg tggcttcttt gaggagaagg gcgccatctt 18180 
ctgcccacca tgctatgacg tgcgctatgc acccagctgt gccaagtgca agaagaagat 1824 0 
tacaggcgtg agtagggctg gctggcgggg aggtggtccc aagcctgtca gtgggaacga 18300 
gggctgctgg gaaacccaca gtccaggtct ctccccgagt gagcctccgg gtccttacca 18360 
gcgtaataaa=tgggctgctg tactggcctc-accctgcatt- agtcaggatg ctcttaacaa 18420 
atgaccatgt tcctgctcag aaaccgccca aggctgcaaa gagcaggagg accaagccag 18480 
gagaagccct gggccctcct gactcccact ttgggctctc cctgccctgg tgaaatgaca 18540 
gaacggccaa cttgacacgc tgaagctgct ctgtctcatg cgtcctcctc atttctggat 18600 
ccagagccag ggctgccagg agtagccaga gagctctgtg tggtgatgtt catattagtg 18660 
aggtttacct tgaccacgag cagtgggaaa ctcaaaataa tggtggctta tttctcatct 18720 
aaaaacatcc cggggtgggt ggtctgggac tgatctggtg gacccaggct ccgccttgtt 18780 
gcttgactgt tggcagcacc tgcttactta ccactcatgg tgcaagatga cacttcagcc 18840 
tccgccaaaa tgctcacctt ccagccagca ggaagtcgga aggagaagaa aggggacaga 18 900 
gccccatggc gtccatcctt agaggatgct gccacctgaa cctctgcttt catcctgttg 18960 
gtcagaaccc agtcacatga ccacacccag tggcaacgga ggctgggaaa tatagtcttt 19020 
attttgggca cccatgtgtc cagcaaaact gggggttcca tcagtcggca agaacgggag 19080 
agtggccgat gcagtggctg atgcttgtat cccagcactt tgggaggtcg aggtgggcag 19140 
atcacctgag gtcaggagtt caagaccagc ctggccaata tggtgaaacc ctgtctctac 19200 
taaaaataaa aaaattagct gggtgtgctg gcgcacctgt agtcccagct acttgggagg 19260 
ctgaggcagg agaatcgctt gatcttgaga ggtggaggtt gcagtgagcc aagattgtgc 19320 
cactgccttc cagcctggga gacagcaaaa aaaaaaaaaa aaaaaaaaaa aaaaagggcc 19380 
aggcacggtg gctcacacct gtaatcccag cactttggga ggccgagatg ggcggatcac 1944 0 
gaggtcagga gattgagacc atcctggcta acacggtgaa accccatctc tactaaaaat 19500 
acaaaaaaat tggccgggca tggtggagta gtcccagcta ctcgggaggc tgaggcagga 19560 
gaatggcgtg aacctgggag gcagagcttg cagtgagccg agatcgcgcc actgcactcc 19620 
agcctgggca acagagcgag actcttgtct caaaaagaaa aaaagaaaga gaaatctgcc 19680 
tcccagcctt gggctcctgc cctaccagcc cacacccctg gtagagcctc ctctcccacc 1974 0 
agctcaaagc ccaagttcct tcactgtgac cttgtctgct cctctaaaac aggcaacacc 19800 
agacagtgag aagagccagc cagacatggg cagaaaacct atttctgtga tctactggct 19860 
gtgtgagcag gggctagttg ctctctctgg gcctcactga agagaagggt ggcactatgc 19920 
tagggccggc acggttgcaa ggtagatgta agatggggta caggtgttgt ggagggcaga 19980 
aatgcaccat ccgaaggcta catgtccccc acacttatgt cttgcttggc ccacactgtt 20040 
tcattttaaa atcagtagca aacaatttaa aaaatcagaa gatttgcctg catgatgcag 20100 
tggctcatgc ctgtaatccc agcactttgg gaggccaagg tgggaggatt gcttgagccc 20160 
aggagttcaa gaccagcatg ggcaccatag caagacccct gtttctacaa aaaaaaaaaa 20220 
attagaaaat tagccaagtg tggtggcatg cacctgtggt cccagctact tgggaggcag 20280 
agggaaagtg agatctcctg ctttttattt ctttatgtat aatgataggg tcttgctctg 2034 0 
ttgcccaggc tggagtgcag tggcatgatc actgctcact gcagccttga tctcctgggc 20400 
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tcagaggatc ctcccacctc agcctcccaa atagctagga ctagaggtgc ccaccagcat 20460 
gctcagcaga tttttaaatc tttttgtaga gatgaggttt tgctatgttg cccaggctgg 20520 
tctcgaactc ctggcctcga gcgatcctcc caccttggcc tcccaaagca ctgggattac 20580 
agacgtgagc cactgcgccc agcagatttc tctttaacac ctagatttca gcctgagcca 20640 
ggcaggcatt cctgaatgaa ccagtagtac tgctcccaga agaagaggtc ctcctccgtg 20700 
tgacacagtc cccacttggc ccttgcaggg attggatctg ggatccctgg atttaaactc 20760 
agggccatcc tcataacagc ctcacaaggc tgggattagc ttcccagttc acaagggaag 20820 
aaaccaagac ttgagaaggt caaggtctgg ccagacccac acatcttgga ccctcatacc 20880 
gcctcgaggc cccatgctgc cctctgcctg ctccagatgt gaatactgct ggccctggct 20940 
ggccccggct ggccccgagg gtcctaggga tgaacagccc agcccaggga gagctcagcc 21000 
ccttgtgcct ctgccccttc ccacctcctg cggaggccag tcgactcacc cacaaagggc 21060 
caggcactgt ggggatagat cagctaacaa aacagttgat gcttcctgcc cttctgggcc 21120 
ttacattttg gctggaagaa gaggggagag gcagactgta agcaataagc gcaataagta 21180 
ggttgcctgg aagtaatgtt agatcacgtt acggaaaaca ggaaagagca gagcgacaag 21240 
tgctggggtg cgtggtgcag ggaaggcagc tggctgctgc tggtgtggtc agagtgggcc 21300 
ctcatggaga agactgcatt cgagcagaaa cttgaagggg gtgaggggtg agcctagaga 21360 
tatctggggc agagcagtcc aggcagaggg gacagccggt gtcaagccca ggacaggagt 21420 
gtgcctggtg tgccagtttc aggcaagagg ccagtgtgca gaggcaaggt gagaacgcaa 21480 
gggagagcag tggcggagac gggtgggaac gaggtcagac ctgctggcct ccagcctctg 21540 
catggggctt ggctcttgct gggagcaatg ggaagcagta cacagtttca tgcaggggga 21600 
gaaggcctgt cttgggttgc aggggcacgc tgtggcagct gggatcagag agaggagctt 21660 
gtaggccagt tgttatgtgg tcccacgggc cagatggcca tggcttacct cacttcaggg 21720 
aggctgtgag aagcactcag aatctggatg tgccttgggg gtgggcccca ctggatttcc 21780 
tggtggacct ggtgtggggt gtgagaggag ggtgtgtttg gctgcagcag acaggagaat 21840 
ggagttgcca tccgcgtgat ggggatggct gtgggaggag aggtttgggg tgagggaatc 21900 
aggaactgag tgctggacat ggcaagtctg aaggcgcagt ggtcgtccac tcagagacct 21960 
tggagttgga gatggaggtg tgggagtcct gaacagttag atgtagtgtt taccgcgaga 22020 
aggaacaggg cttgcggcca gccctcctgt gttcccgtga cccagggcag ggcaggaggg 22080 
gcctgagcct gccgagtgac tgggacctcc ttccaggaga tcatgcacgc cctgaagatg 22140 
acctggcacg tgcactgctt tacctgtgct gcctgcaaga cgcccatccg gaacagggcc 22200 
ttctacatgg aggagggcgt gccctattgc gagcgaggta cccactggcc agtgagggtg 22260 
aggagggatg gtgcatgggg caggcatgaa tccaggtcct ctttctctct gcccccattc 22320 
tcagactatg agaagatgtt tggcacgaaa tgccatggct gtgacttcaa gatcgacgct 22380 
ggggaccgct tcctggaggc cctgggcttc agctggcatg acacctgctt cgtctgtgcg 22440 
gtgagagccc cgcccctcga actgagcccc aagcccaccg gccctctgtt cattccccag 22500 
gagatgcagg agaagttggg aaggggcctc tcctgctgcc cccaacccca tgtgactggg 22560 
cctttgctgt ccttagatat gtcagatcaa cctggaagga aagaccttct actccaagaa 22620 
ggacaggcct ctctgcaaga gccatgcctt ctctcatgtg tgagcccctt ctgcccacag 22680 
ctgccgcggt ggcccctagc ctgaggggcc tggagtcgtg gccctgcatt tctgggtagg 2274 0 
gctggcaatg gttgccttaa ccctggctcc tggcccgagc ctggggctcc ctgggccctg 22800 
ccccacccac cttatcctcc caccccactc cctccaccac cacagcacac cgatgctggc 22860 
cacaccagcc ccctttcacc tccagtgcca caataaacct gtacccagct gtgtcttgtg 22920 
tgcccttccc ctgtgcatcc ggaggggcag aatttgaggc acgtggcagg gtggagagta 22980 
agatggtttt cttgggctgg ccatctgggt ggtcctcgtg atgcagacat ggcgggctca 23040 
tggttagtgg aggaggtaca ggcgagaccc catgtgccag gcccggtgcc cacagacatg 23100 
aggggagcca ctggtctggc ctggcttgga ggttagagaa gggtagttag gaagggtagt 23160 
tagcatggtg gctcatgcct gtgatcccag cactttggaa ggccaaggtg ggcagatcgc 23220 
ttgaggtcag gagttcgaga cctcatggcc aacacggtga aacagcgtct ctagtaaaaa 23280 
tacaaaaatt agccgagtgt ggtggggcat gcctgtaatc ccagccactc aggaggctga 23340 
ggcgggaaaa tcacttgaac ctgggaagtg gaggttgcag tgagctgaga tcacaccact 23400 
gcgcgcgagc ctgggtggca gatggcagag cgagaccctg cttcaaaaaa aaaaaaaaaa 23460 
aaaaaaaaaa gaagggtagt tgtagttggg ggtggatctg cagagatatg gtgtggaaaa 23520 
cagcaatggc cacagcaaag tcctggaggg gccagctgcc gtccaaacag aagaaggcag 23580 
ggctggagag ggtagccctt aggtcctggg aagccacgag tgccaggcag tagagctggg 23640 
gctgtctctt gaggttaggg cagggcaagg cacagcagag tttgaaatag gtttgtgttg 23700 
tattgcagaa aagaggcccc agaacactga gggagtgcag gagggaggct gggaggagga 23760 
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gttgcagcag ggcctagggg cgggggccag gcaagggagg ggcagagagt aatatggcag 23820 
agatgggacc cagtggcagg tccgggggat gagggatgga gagaaggaca ggagcgttgc 23880 
caggcatctg gcctatacca gacatgctca cgctgtctcc cgcgaacctc ctagcaacct 23940 
tgcgccgttg tctgcaatca cttatttcat tttttctttt ttaactttaa ttttttttgt 24000 
ttttaagaga caggatctcc ctaggttgcc cgggctggtt tcaaactcct gggctcaagc 24 060 
aattcttcct ccttagcccc aaagtgctgg cattacaggt gtgagccacc atgcctggcc 24120 
cacttatttt ctagatgagg cacagaaaga ttgggagact tgaccaaggt cacgctgtca 24180 
ttgagccatg agccagacta gaatccaggc ctgaagctgg gtgcgctgtc ccaggactgg 24240 
ctggcactga gtaccatttg ccagcgagca tctctctggg aagctgactt ctgcccggta 24300 
cctggaggac tgtagacctt ggtggtggcg ccgtcactct ggggcttcct gcctcccact 24360 
gatgcccgca ccaccctaga gggactgtca tctctcctgt cccaagcctg gactggaaag 24420 
actgaagaga agccttaagt aggccaggac agctcagtgt gccatggctg cccgtccttc 24480 
agtggtccct ggcatgagga cctgcaacac atctgttagt cttctcaaca ggcccttggc 24540 
ccggtcccct ttaagagacg agaagggctg ggcacggtga ctcacacctc taatcccagc 24600 
actttggaag gctgaggctg gagaagggct ccagcttagg agttcaggac cagcctgggc 24660 
aacatggtga gaccctgttt tgttttgttt tttgtttttt tgagatggag tcttgctctg 24720 
tcgcccaggc tggagtgcag 24740 



<.2.10> 42 
<211> 25 
<212> DNA 

<213> Rattus norveglcus '" 
<400> 42 

gcactacctt gaaggaatcc atggt 25 
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c210> 1 
<5ll> 457 
<212> PRT 

<2li> Ratt\x& norvegiciis 
<400> i 

Wet Asp Ser Phe Lys Val Val Leu Glu Gly Pro Ale* Pro Gl y 
15 ID 15 

Arg Leu Gin Gly Gly iya Aap Phfc hzn Va.1 Pro Leu Ser He Sex Axg 
20 55 3D 

Leu Thr Pro <3fly Gly LyB Ala Ala Gin Ala Gly Val Ala Vsl Gly Asp 
35 40 45 

Trp Val Leu Ser Ilo A£p Gly Glu abtj Ala Gly Ser Lou Thr His He 
50 5S 60 

Glu Ala Gist Aan Lys lie Arg AlA Cys Gly Glu Arg Leu Ser Leu Gly 
55 70 75 60 

Leu Ser Arg Ala Git* Pro Ala Gin Ser Lys Pro Gin Ly* Ala l*eu Thx 
B5 SO 55 

Pro Pro Ala Asp Pro Pro Arg Ty? rhr Phe Ala Pro Ser Ala Ser Leu 
100 105 110 

Aan Lye Thr Ala Arg Pro Phe Gly Ala frro Pro Pro Thr J^P Ser Ala 
115 3l20 125 

Lsu 3«r Gin Aim Gly Gin Leu htu Arg Gin Leu Val Pro Aap Ala fler 
130 135 140 

Lye Gin Arg Lau M$t Olu Aen Thr Clu Asp Trp Arg -M>9S PrQ Gl V 
14S 150 155 160 
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Thtf Qly Gin Sex Axg fi*r We Arg lie Lau Ala Hie Leu Thr Gly Thr 
165 1™ 175 

Glu Phe Met Gin Aep Pro Asp Glu Glu Phe Mat Ly0 Ijye Ser- Ser Gin 
180 135 150 

Y*l Pro Axg *hr Glu Ala Pro Ala Pro Al* Str Tfcr lie Pro 01 it Glu 
15S 200 205 

fler Txp Pro Gly Pro Tfcr Thr Pro Sar Pro rhr S«r Arg Pro Pro Trp 
2L0 215 220 

Ala Val Asp Pro Al* Ph* Ala Glu Arc? Tyr Al* Pro A Bp Lys Thr 
2 25 230 235 240 

Tfcr Val Leu Tfcr Arg Kia 2cr Qlri Pro Ala Thr Pro TH** Pro Leu Gin 
245 250 255 

Abel A*g Thr Ser II* Val Gin Ala Ala Al« Qly Gly Gly Thx Gly GLy 
2£D 265 270 

Gly Ser Aen Aen Gly Lys Thr Pro Val CyB Hi a Girt Cya Hie Lye rla 

~ 275 260 265 " 

XI* Arg Gly Arg Tyr Leu Val Ala L*u Gly Hie Ala Tyr His Pro tflg 

295 300 

Glu Fhe Val Cys fler Gin Cya Gly Lye Val Lou Gl\* Qlu Gly Gly Phe 
3D5 310 315 320 

Phe Glu Glu Ly« Gly Ala lie Phe CyP Pro Ser Cys Tyr Asp Val Arg 
325 330 335 

Ty* Ala Pro &er CyS Ala LyB Gya Lys Lys LyS He Thx Gly Glu 11b 
34D 345 

Met His Ala leu LyB Mat Thr TVp Hie Val Pro CyB Phs Ttir Cy£ Ala 
355 360 3*5 

Ala Cys Lyjg Tfc* Pro He Arg Abii Arg AI* Pha Tyr Wet Glu Glu Gly 
370 375 *et> 

Ala Pro Tyr Cys QLu Arg Aep Tyr Glu LyB Met Phc 01 y Thr Lye Cya 
355 390 395 400 

Arg GLy Cys Aep Phe Lys II© A»p Ala Gly Aep Arg Pha Leu Glu Ala 
405 410 415 

Lau Gly Pbe Ser Tr? Ki£ Aep Thr Cya Phe Val Cys Ha Cya Gin 

42D «5 450 

31c AG** Leu Glu Gly LyB Thr Phe Tyr Ser Lye LyB Asp Lys Pro Leu 
455 440 445 

Cya Lys Sex Rl(5 Ala Phe Ser Hi a Val 
450 455 
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<210> 2 
<211> 
<2i2> DPTA 

<2i3> RattuB norvegicus 

<400> 2 

gcacga^at cccagcgc^g ctcctggagg 
ggagcaggta cCfttggattc cttcaaggta 
cgtctgcaag ggggcaags* cttcsacgtg 
ggcaaggccg cacaggccgg tgtggccgtg 
aacgcc*ggaa gcctcacaCA cattgaagcc 
Ctcagcctgg gt-Cttagcag agcccagcct 
cctccd£CC?g accccccgag gt&cactttt 
eggccctfcog ^gcaccccc acctactgac 
agacagctgg tccctgatgC cagcaagcag 
cogcggccag ijgacaggcca gtcccgttcc: 
gagtfccstgc aagacccgga t^ggaattc 
gaagcccca£ ceccagcctc aaccatsccc 

gacaaaacca geacagtgct gacccgatfAC 
A AC agcacct. ccata gttca ggetzgc agct 
ggcaagaegc ctgtatgcca ccagtgceae 
ctgggccaog cgfcaCCSLtcc tgaggaattt 
gagggtggtffc tcttcgagga gaagggagct 
t^fcgcaccca gctgtgeeaa atgcaagaag 
aagatg&cct ggcatgttcc ctgcttc^co 
agggctttct ac^tggagga ggggge-tccc 
ggcacaaagt gtcgcggctg tgactteaag 
ctgggtttca ^etggcatga tacgtgtttt- 
ggaaagacct tctactcCAA gaaggacaag 
gtatgagC&c etcctcacac tactgcci&ce 
gagrtctct*; tCQctC^ace tttctgggtg 
cctg^ccaga tcctggggct ccctcctsae 
accacogtca ctd&caggtg ctagDctcct 
cctgtatgta gctgtg 



ccgccagsca £C<^ccc*gc egggcattca sn 
gtgctggagg gacctgcccc ttggggcttc 120 
cccctctccA tctctcggct cactcctgga 180 
cf^gactggg tactgagtat cgacggtgag 23<0 
cagaacaagA tecgtgcctg tggggagcgc 300 
geteagagca aaccacagaa ggcccfcgacc 360 
gcaccaagcg oetCCCtcaa caagacggcc 42 D 
R§cgccetgt cgcagaatgg acag-ctgctc 430 
cggctgatgg a-^aataetga agactggcgc 54 D 
ttccgoatcc ttgct-cacct cacgggcaca 600 
atgaagaagt caagccaggt geeoaggaca 650 
caggaatcct ggcctggccc caccac^ccc 720 

r»iy.i/if,-Tn» f h « ^.^.n-a.^rr-n r*t n |-rrr*.".r?r«,o T R i"N 

34UWVVg»r«U «r — ^— J — — _-j . 

agceagccag cca cacctac gcctctgCA£ 340 
ggagggggca caggaggagg cagcaacaaE 3 DO 
aagatcatcc gcggccgata cct£r3t&9C6 960 
gfcgtgc-agCO aqtgtgggaa ggtcctggaa 1020 
atcttttgcc cctcctgcta tgatgtgc£c= 
aa^atcactg gagagatcat gcatgcgctxg 11 4D 
tgtgcagcct geaaaatfcdC tatccgcaae 120D 
t»ct§cgagc gagattacga gaagatgttfc 1260 
atcgatgccg gg^ACC^ttt cctggaagcc 13 2D 
gtttgcgcaa tatgtcaaat caactfrggaa 13S0 
cccDtgtgca flgagceatge cttttcccac 1440 
etactctgcc agaagggtga taaaatgaga 1500 
gggctggcag eeatfcgtcct agccttggct 15 BO 
agtccccttt cccacactte etccaccaccs l£20 
agccCCAgtt cactetggtg tcacaataaa l€S0 



<21'i> 260 
£212> DKA 

^213> Rattua norvegicus 
±WQ> 3 

ttctacat^g aggAgggggo tccctsctsc 
Aagbgtcgcg gctgtgactt c«^gatc?gat 
ttcagcfcsge- atgatacgtg ttttgttt^c 
AC^ttctact ccaa^aagga cM^Coactg 
gcacctcctc ACACtaetgc 



gagcga^^tt acgagaagat gtttggcaca 60 
gccggggacc gbttcct^A ^gccctgggt 12 D 
gc^^t^tgtc aaatcaact t ggaaggaaag loD 
tgcaagagcc at-sccbtttc ccacgtatga 240 

260 



<211> X6 
<212> DKA 
<213> MMLV 



3 
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<400> 4 

aagctttttt tttttg IS 



<2l0? 5 

<212> 1*1* 
<213> MMLiV 

<0DO> 5 

aagcttggct atg 



<212> VWL 

e213> Homo sapi^e 

c4O0> 6 

atcettgste scctcacgsg es^sg^gtts stgeaagiifrc cggatga^g?. oracctg^g £Q 
^aatcsagcc aggtgCtf^ag gacagaagcc ccagccccag c^t-Cab-Ctac sccccaggag 120 
ccctggwtg gccc tBccgc cc ccagccct aeesgccgcc ogccctgggc tgtggaccct 
gegtttgcog agcgct-.atgc cccapcaaa accagcscag bg« ~ ~ 225 



<2l0> 7 

<211> 717 

<2l2> DNA 

<21Z> Hottio sirpiene 

<:40D> 7 

atggattcct tcaaggtagt gcfcggagggg 
ggcaaggact f.tfaatgtgcc cctctccafct 
caggccggag tgsccgtgsg tgactgggtg 
ctcacttC*ca tcgaagcfcca gnctQ993 A ^ 
etcagcaggg ccc-a^CC^gt tcagagcaaa 
cctccg^ggt acacctttgc acQc»gc<gt<j 
gcgcccccgc ccg^tgacag cgccccgcaa 
cc^grttgcca gcaagcagog gcttsatggag 
acaggccagt cgc^Utectt ccgcatcctt- 
gasctr^gatg aggagcacot g-Tm-gaaatca 
ccagcctcat ct*<2acccea ggagccctgg 
ogdccgocct gggctgtgga ccctgagttt 



ccagcacctt ggggcttccg e<iLgcsaggg 50 

tet?eggetca ctcctggggg caaa-gcggcg 120 

ctgstgratpg ^fcggcgagaa tgcgggtagc I BO 

cgggcctgcg gggagcgcct. cagcctgggc 240 

cogcag^agg cCfc<:c?gcccc cgccgcggac 3 DO 

tccctcaaca agacggcccg gwct bfcggg 36 G 

caqaatgg»a agccgctccg accgct^gtc 420 

aacacagagg act^ggcggcc gcg^cegggg 46D 

gcccac?t<2a caggcaccga gttcatgcaa £4 0 

agccaggtgc ccaggacag* iss^cccagcc 6 DO 

cctg^CCCUa ccgcccccag ccctaccagc $£0 

gccgagcgct atgccccgga cart«.*rtg 717 



<:21t» 6 

<212> DWA 

<2%1> Horno sapiens 

<400> 8 

stcgatggcg ogjsAtgcggg tagccfccaca 

Lgcggggagc gcctcagcct gggcctcagc 

aaggcctwH cecccgccgc ggaccctccg 

aacaagacgg cccggccctt l.ggggcgccc 

gg^Qjigcegc tccgaccgct ggtccwg^t 



c^catcg?!^ iiLcag2acaa gatccgggcc 60 
agggcccagc cggttcag^g t?A*ac*cgcag 12 D 
cggfcaoaect ttgcacccag cgtctccctc 180 
ccgcccgct^g acagcgcc^o geaaeagaat 240 
gccagcaagc agcggcbgat ggagaacaw 3 00 
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gaggactggc ggcc^cggcc ggggacaggc 
ctc&^aggca ccgagttcat gcaagace^g 
gtgcccaggR ^agaagcccc atgccccagcc 
cctaccgccc ccsgccctac cagccgcccg 
cgctat^crtfC! c^g*caa*ac £*gcacagtg 
acgccgctgc ag.sgccgcac ctrcattgtg 
ggczigcaae*. Acggcaagac tcc-cgtgtgt 
tacctggtgg ogttgggcca cgcgtaccac 
aag^tcctgg aagagggtgg ettctttgag 
tatgaogtgc gctatgcacc cagctgtgcc 
atgcatjgcce tgaagatgac etg^c^gtg 
cccatccgga acsggocctt. ctacatggag 
gagaagatgt ttggcaogaa at^ccatggc 
ttcctggagg ccctgggcfct cagctggcat 
Atc^acetgg aaggaaagac ctt^ tactic: 
gccttctctc abgtgtgagc cccttctgcc 
gggcctggag tcgtggccct gcatttotgg 
gctcctggcc cg*gC*e*-ggg ctccogggcc 
ctccctccac caccacagca caccggtgct 
ccacaata^A eotQlzcct-A gctgaattcc 



cagtcgcgtt ccttccsgcat ccttgcccac 360 
gat0Aggagc asctgaagaa atcaageoag 420 
tea tcta cac cccaggagoc? ctggcctggc 4 BO 
ecefcgag^tg tggacectgc gtttgecgag 540 
ctgacccggc acagccagcc ggcc^c^occ 600 
caggcagctg cc$£*ggggt gecaggaggg €60 
caocagtgcc acaaggtcat ccg^ggccgc 720 
ccggaggagt ttgtgtgtag cca^tgtggg 7B0 
gagaagggcg ocatcttctg cccaoeatgc B40 
aagtgcaaga agaagatt&O aggegagate 900 
c-actgettba cctgtgctgc ctgcaagscg- 9S0 
gagggcgtgc ectattgeg* gcgagactat 1020 
tgtgacttca agatcgaege tggggaedgc 10 30 
gacacctgct tcgtetgtgc gatatgtcag 1140 
flagaaggaca ggcctctctg caagagecat 1500 
cacagctgcc gcggtgg t«c;c et*gcctgag 1260 
gta^ggotgg TOcitggttgc cttaaccctg 1220 
cc^Dcccacc caccttatcc teee-Accsca l2t$0 
ggecacfacCA gwccctttc acctccagtg 1440 
staaaaatcca aaaaaaaa 1468 



<^2iQ> 5 
■:21i> ^^4 

-c2i3> HaitK> sapiens 

<400> 9 

atggattcot tonaogtagt gctggagggg 
ggcaaggact t-saatgtgcc cctcrtccatt 
c*ggccg-3*g tggcc^tggg tgaetgggtg 
ctcacacaca tcgaagDtca gaacaagatc 
etcaged^gg r^p^ceggt fccr<gngcaaa 
cctccgoggt acacctttgc acccagcgtc 
gcgcecetjge e^ctgac^g Ps^cccgcnc* 
ccagatgcca geaagcageg gctgatggag 
Aea^gccagL egcsgffct^ctt ccgcatccfct- 
gaeceggatg aggagcacct gaagaaatca 
Ctfiagctftqal; eta eae* cca ggagcnctg^ 
cgcccgccct sggctgtgga ccctgcgttt 
acagtgetga cccggcaca^ oeagffrtig^Qc 
attgtgcagg cagctgccgg aggggtgeca 
gtgtgtcacc agtgccacaa ggt^afceegg 
taccacccgg aggagtttgt gtgtagccag 
tttgaggaga agggegecat cttctgecca 
tgtgccaagt gcaagaagaa ga fctacaggc 
cacgtgcact gctttacctg tgetgeetgtf 
^tgg^SgaSST gcgcgcccta ttgegagega 
ca tggctgtg acttcaagat cgaegetggg 
tggcatgaca cctgcttcgt ctgtgDgata 
tactccaaga aggacaggetf kckatga&tiQ 
tctvgcccaca gctgccgcgg tggcccctag 
ttctgggtag ggctggcaat ggttgcctta 
egggewctg ccc&cQcacc ttatcctccc 
ggtgctggcc acaccagccc cctttcacct 



ccagcacctt ggggcttccg g^^gc-^ggg 60 
tecc^g^bca ctccfcggggg caaageggeg 12D 
ctgagcatog atggegagaa tgegggta^e 1&0 
eqggeetgtiQ gggagc^gect cagcctgggc 24D 
cegcagaagg cct ccgcccc cgcegeg^A^ 300 
tccctcaaea agacggcccg ^occtttggg 36 0 
cagaatggac agccgctccg acegct^gi-.a 43 0 
aacacagagg actggc^cc seggcogggg 4a D 
gcccacctca caggcaccga gttc3tg<raa 54 0 
agccaggtge ccag^aoaqa agccccagcc 600 
octggccc ta cogcccccag ccccaceagr: 660 
gccgagcgct atgeeccgg« raaaacgagc 720 
.^cgcccacgQ egctgeagag ccgcacctcc 790 
99 a S^D9 ca geaacaaegg fjaag^cbccc 840 
SgCcgctrtcc tggt^gogtt gggccacgcg 900 
tgtgggaagg tcctggaag^ ^gt^gcttc 960 
cir2ihgot^tg aegbgegcta tgcacccagc L02D 
gagatcatgc acgccctgaa gat^cctgg lOSD 
ftagAcgCccci tcc^gaacag ggccttctac lldD 
gactatgsga agatgttcgg ca^ga^^tgc L200 
gacggcttcc tggaggcect gggcttcagc I26D 
tgtcagatca acctggaagg AaA^Accttc L32D 
Agceobgcct tctotcatgt gtgagcccct 13&D 
cctgaggggc ctggagtcgt ggc^tg^t L44D 
acDfiliggct.^ ctggcccga^ cctgggctcc 150D 
accccactcc ctccaccacc acage?acacc 1560 
ecagtgcciiSic ji»taaacctg tacccagctg 1G20 

164^ 
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<21D> 10 
c211> -457 
<212> PRT 

<213> Hccno aapi nc 
c4D0> 10 

Met A*p Ser Phe Lya Val Val Leu Olu Gly Fro Ala Pro Trp Gly Vte 

Arg LBu ain Gly Gly Lyp Asp Phe Aen Val Pro Leu fler lie Bar Arg 
20 25 30 

Lftu T^r Pro Gly Gly LyH Ala Ala Gin A}* Gly Val Ala Val Gly Asp 
35- 4D 45 

Trp Val Lou &ttr He A^p Gly Glu Aan Ala Gly Ser leu Thr His He 
50 55 £D 

Glu Al* Ctfn Aen LyH Ha Arg Ala Cys 01 y Glu Arg Leu Ser Leu Gly 
65 " 70 75 60 

Lau Ser Arg Ala Gin Pro Val Gin Sex Lys Pro Gin Lya Ala Ssi" ALA 
£5 BD 95 

Pro Ala Ala A3p Pro Pro A^ Tyr Thr Phe Ala Pro ser Val Ser Lev* 
10D 105 110 

A^n Lye Thr- Ala Axg Pro the Gly AiA Pro P*o Pro Ala Aap Ser Ala 
115 120 125 

Pro Gin Gin Asn Gly Gin Pro Leu Arg Pro Lau Val Pro Asp A1& Ser 
130 135 140 

Lye Gin Arg Leu Mat Glu Asrt Thr Glu Aap Trp Arg Pro Arg Pro Gly 
145 150 15S IfiO 

Thr Gly Gin Ser Arg Ser Phe Arg II© Leu Ala Hie Leu Ofcr Qly Thr 
165 170 ItS 

Glu Phe Ket Gin Asp Pro A;?p Glu Glu Hie Leu Lys Lys Ser Sar Gin 
igo 185 19D 

Val PXO Arg S^r- Qlu Ala Pro Ala Pro Ala fl&r Bcr Thr Pro Gin Glu 
1*5 30D 205 

Pro T*rp Pro Gly Pre? Thr Ala Pro Ser Pro Thr Sar Arg Pro Pro Trp 
210 215 22D 

Ala Val Asp Pro Ala Phe Ala Glu Arg Tyr Ala Pro Asp LyB Thr Ser 
525 230 2^5 240 

Thr Val ~Lgxl Thx Arg Ha& Ser Gin Pro Ala Thr Pro Thr ?ro htu Gin 
245 250 255 

Ser Arg Thr Sar lie Val Gin Ala Ala Ala Gly Gly Val Pro Gly Gly 

6 
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260 265 270 

Qly Sex Aon Asa Gly Dye ?hr Fro Val Cys Hia Qln Cye Hi* Lya v^l 
275 260 2BS 

lie Arg Gly Arg Ty r L*u Val Ala Lsu Gly His AlA Tyr His V-tO Ola 
29G 295 3QO 

Glu Phe Val Cyf Ser Gin eye Gly Lye val Leu Glu Glu Gly Gly Phe 
305 310 31$ 32D 

Pti* Glu Glu Lya Gly A1& lie Ptot Cy» Pro Pro Cys Tyr Asp Val Arg 
325 330 335 

T*yr Al* Pro Sex Cys Ala Lya Cya Ip/3 Ly3 Lya lis Thr Gly Glu II* 
340 345 250 

Wet Hia Ala. Leu Hfct Thr Trp His Val Kis Cya Pbe Tbr- Cys Ala 
355 360 365 

Ala Cy£ Lye Thr Pro lie Arg Aan Arg Ala Phe Tyr Met Glu fllu -91y 
37D 375 36D 

Val Pro xyr Cys CJlo Arg Asp Tyr Glu lya Met Phe Gly Thr lya CyB 
395 395 4DD 

Hie <3ly CyB Asp Phe Lys 21© Asp Ala 01 y A3j> AjT9 Phe I«eu Glu Ala 
405 41D 415 

Leu Oly Ph* S^r Trp Hia Asp Thr Cys Phe Val Cys Ala lie Cys Oln 
420 425 63D 

lie Aan Lqu <31u 01 y -Lys Thr Phe Tyr Ser LyB Lya Aap Arg Pro Leu 
435 440 <545 

Cye Lya Ser Hia Ala Ph<* Sex Hifc V^L 
450 455 



<210> 11 
<211> 22 
c212> DWA 

<21J> Rafctue norvegicuB 
<4O0> 11 

gccagggttt tcccagtcac ga, 22 



<210> 12 

<211> 22 

<2l3> Rattus jiorveglcue 
<4 00> >.2 

gcca-gggttt tcccagtcac yix 22 



BNSDOCID: <WO 0066178A1TL> 



wo wmm 



PCT/USW11664 



-:210> 13 
<211> 22 
<212 5- DNA 

<2i3> Hcxtio eapiwe 
<4Q0> 13 

tcttagcaga gcccagcctg ct 22 



<210> 14 

<211> 22 

<;212> DWA 

<213> Homo sapiens 



<210> 15 
<212> PKA 

<212> Rattua norvegicua 
<40D> 15 

atccttgctc 3cctc?acggg 20 



<21D> ld 
<2H> 22 
<2l2> DMA 

<2JL2> Ratt.ua norvegicua 



<210> 17 
<212> 2} 
<2i2> ena 

<:4G0> 17 

catggattcc ttcaaggtap bgc 23 

<:210> IS 

<2ll> 20 

<212> D»A 

<21i? Homo sapiens 



<400> 14 
gCAtgaac-tcr 



tgtgcccgbg ag 



22 



gcactgtgct ggtfctfc^tdt g-gf 



22 



<400> ie 

gttttgtotg gggcagagcg 



20 



<210> 19 
<211> 44 
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<212> DMA 

<2X3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: adaptor for 
Marathon RACB reactions 

<:4D0> 19 

ctaatacgac tcactatagg gctcgagcgg ccgcccgggc aggt 46 



<210> 20 
<211> 2? 
<212> tUTA 

<213;- Artificial sequence- 
<220> 

<222t> Description of Artificial SegufcrtC^: PCR primes 
(Specific for Marathon RACE adaptor 

<400> 20 

ccatcctaat asg^Cteact atagggc? 27 



<21D> 21 

<212> DNA 

<213> Homo eapiene- 

*40D> 21 

cogttgtttg toA«c<gac^ 
atggattcct tcaaggtagt 
ggcaaggact tcAatgtgcrc 
eaggccggag tggccgtaag 
ctcACAcaca togaagctca 
ctcaacaggg cccagccggt 
cetcegcggt acacctttgc 
gcgcccccgc cogctgacag 
ccagAtgco gcaagcagcg 
acaggccagt gccgttcctc 
gacccggfrts aggagcacct 
ccagcctcat ctacacccoa 
cgcccgcwt gggctgtgga 



cagagcagcg ccctggcc^g 
gctggagggg ccagcacctt 
Ctcetceatt tcccggctca 
tgactgg^tg ctgj?°catcg 
gaacaagatc cgggcctgcg 

accaagogtc tccctcaaca 
cyccevgc&g cag*atgg*<? 
gctgatggag aacacagagg 
tcgcatcctt 9ctc&cc?ttfl 
gaagaaatca agccaggtgc 

ccrtgcgttt gcogagogct 



gccaagcagg agccggcatc 60 
g^ggcttccg 3Ct&eMg$$ 120 
cctctggggg caaggccgtg ISO 
atggogagAA tgcsggtagc 2 40 
gggagcgcct cagccbgggc 3 00 
CCt^CJgcccc cgecgeggac 360 
agaoggcccg gcccttgggg 420 
agcegetocg - acegctggtc 4 BO 
actggcggcc gcggccgggg 540 
ea^gcaccga gtteatgoaa 600 
ccaggacaga agccccagcc 6€0 
ccgeedccag cectaccagc 720 
atgcc IBS 



<2i0> 22 

<211> 16B3 

<212> DHA 

<2i3> Homo sapiens 

<400> 22 

cgacgcagag cagogccotg gccgggccaa 
gtagtgctgg aggggccagc accttggs^d 
gfcgceeotet ccatttcccg gctcactcct 
gtgggtgact gsgtgctgag catena fc^gc 
gctcagaaca agatccgggc ctgcggggag 



gcaggagccg gcatcstgga ttccttcaag €0 
ttdCgcrctgc *ag0gggeaa ggac=£toaat 120 
gggggcaaag cggcgcaggc cggagtggcc 130 
$agaatgC9g gt-age^ttfac* aeacatogaa 240 
cgcctcagcc tgggcctcag cagggcccag 300 
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ocggttcaga gc**accgca gaaggcot cc 
ttt^fCACCoa gcgtctccct caacaatgacg 
gacagcgcce c^C^aeagaa tggacagcog 
cagcggctga tggagaacac a^Si^A^t^-gE 
tccttccgciA tCCttgccea cctcacaggc 
cacetgaaga aatcaagcc* ^stgccc'A^ 
ccccag^gc eofcggcctgg ccctaccgcc 
gtggaccctg cgttt^ceg* gegctatgec 
caca^ccagc eggccacgcc cacgccgctg 
gcaggagggg tgccaggagg gqgcagcaac 
cacem$gtea teeggggccg ctacctggtg 
tttgtgtgta gccagtgt^g gaaggtectg 
gcCAtcttct gcccaccatg ctatgacgtg 
aagaagafcta c^^cgagat catgcaogcc 
dcct^ftgetg cctgcaagac gcccatcogg 
ecctattgcg atgegasacta tgagaagatg 
aagategacg ctggggaccg cttcct-ggag 
ttcgtctgtg cgRtatgtea gateaacctg 
ageycctetct gcaagsgcca tgcctt-cfcct 
cgcggtggcc cctSSC^tsa ggggcetgga 
gcaatggttg ccttaaccct ggcfccotggc 
ccaccttats ctcccaecco actccetcca 
Acrccc 1 1 1 cacctccagt gecacaafc&a 
aaaaaaaaa 

*210> 23 
<211> 32 
<212> DMA 

«:213> HCCftO flapiBOB 
<dOD> 23 

gcactgtgct c^tCttgtcc gg 



<3ia> 24 

<2U> 2a 

<2L2> DMA 

<213> Hcrftio sapiens 
<4QO> 24 

tccttgctca ccfccacgggc * 

<210> 2$ 
<2ll> 3d 

<2l3> Homo aapi«ns 
<*00> 25 

tcctcatcog ggtcttgcat gaactcggbg 



<?10> 26 
<211> 26 
<:212:> DNA 



gcccccgccg oggacoatco geggtacacc 3£0 
gCCCg^Ccct ttggpgogcc cccgcccgct 420 
ctccgaccgc tggtcccaga tgccagcaag 4B0 
c^crccgc^c cs^^acuwg CCagtcgcgb 540 
accgagttca tgcaagaccc ggatgaggag 6DD 
Acaga-agcce cagcecwg^ ctcat^baca 660 
cccagcccta ccagcogccD gccctgggct. 7.2 G 
oeggacaaaa cgagcacagt gctgac^ccgg 760 
cagagccgca ccbccattgt gcaggcagct 64 G 
aaeggeaaga ctoccgtgtg teaccagtgc 90i> 
gogttgggcc acgcgtacca ccoggaggag 
gaagagggtg gcttctttga ggagaaggge 1020 
cgctafcgcac ccagcbgtgc caagtgcaag LOGO 
ctgaagatga cctggcacgt gcactgcttt lldO 
aacagggccb tctaOatgga ggagggcgtg 1200 
tttggeacga aatgccatgg ctgtgacttc L260 
gccctgggcb t<?*i$cbgg\>A tgacacctgc L320 
gaaggaaaga ccttctactc caagaaggac 13B0 
C : at9tgt$<ig cc?CCttCtgc ceacageterC 1440 
gtGgtggeco tgcatt.tctg ggtagggccg X 5 00 
cc^^cctg^ g^tCC^gC! ceetetCCC'Ac: 1560 
Dcaccacagc acaccggtgc tggccacsDC 1620 
« ccZ$t Wtgctg^tt^z^a^^ZvQ ' i £ 9 0 

16B5 



22 



21 



30 



10 
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<213> Hon>o sapiens 
<400> 26 

gcccccgCCC ajctgacagog ccccgcaa 25 



^2X0> 27 

c;2ii> 24 

c212> DMA 

<2ia? H$mo sapiens 

tccttgctca cctcAcgge^ aceq 24 



<210> 23 

<i211> 22 

<212> dwa 

<313> Homo sapiens 

<40D> 26 

Cftaatacgac tcactatagg" CfC 22 



<2l0> 29 
<211> 23 
<212> PNA 

<213> RattuB norve^ieua 

<400> 23 

gc^sC'tcjato! gagaatactg aftg 23 



<2l0> 30 
<211> 23 
<212> PNA 

<r213> Rattus norve5±c?ue 

attffctgfcgge actggtggca tac 23 



<210> 31 
-c2H> 22 
<212> DKA 

c213> Rattue norvegicue 
<4DD> 31 

tgtgtcgggt caefC&ctgtg ct 22 



<210> 32 

<2ll> 1620 

-c212> DNA 

<213> Kocno sapiens 



11 
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<400> 32 

atggattcct tcaaggta^t gct^a^ggs 
ggcaag«r<ict tcaatgtgcc cctctccatt 
caggcoggag tggc^tggg tgacter^gtg 
ctcac&caca tcgaagctca gaacaagatc 
otcagcaggg cccaSCCggt tcagagoaaa 
cctccgcggt acacctttgc acccagcgtc 
gcgecccogc ccgct-saca.9 cge'ccegcaa 
ccapatcicca gcaagcsgcg gctgatggag 
acaggccagt cgcgttcctt ccgcatcctt 
gacc<?g«iAtg aggagcacct gaagaaat-ca 
ccagcctcat ctacacccca ggagccctgg 
cgcccgcect gagctgtgga ccctgcgttt 
acagtgctga ccoqigcac&g ccsgccggcc 
attatgcagg eagctgccgg aggggtgcca 
gtgtgtcacc agt^ceacaa ggtcatccgg 
t«ccaccogg aggagtttgt gtgtagccag 
tttgaggaga agggegcoat ettctgecea 
terttfccaagt gcaagaagaa gattacaggc 
caegtgcact gc.fc.tfcaectg tgctg.ectgc! 
atggaggagg gcgtgcccta ttgegagcga 
Gat.9gct.gtg aettcaagat ogacgctggg 
^tggca'Cga^^cct^cb't'cgt'^ct^'t^cga b a 
tactcc«A$a aggacaggcc tctctgcaag 
tctgcccaea gcbgeoatc^g tg^cccct^g 
ttctgggtag ggctggcaat ggttgcctta 
cgggcccctg cccacccace ttatcetcce 
ggtgetggc<? acaccagccc cctttcacct 



ec&grcacctb ggggcttccg gctgcaaggg 6D 
tcccggctca atoctggggg eaaageggcg 120 
<=t$agcatcg atggcgagaa tgcgggtagc 1B0 
ogggcatgcg gggagcgcct cagcctgggc 240 
OC^cagaagg ccbccgcccc cgccgcggac 3 DO 
tccctcaae-a agacggcccg gcccfcttggg 360 
ea$aabggac agccgcbccg ace-gctggtc 420 
aacacagagg actggcggcc gcggcegggg 400 
gcccacctca csggc&CCgs gttcatgcaa 540 
agccaggtgc ccaggae-aga agccccagco 5 DO 
cctggcccta. cc^cccecag ccctaccagc 660 
gccgagcgct atgccccgga caaaacgagc 720 
aogrcccacgc csct^cagag ccgcaccbcc 7 BO 
ggagggggca gcaacaaogg caagactcoc 840 
ggccgcfca^c tggtggcgtt crggeCAO^cg SOO 
tgtgggaagg tcctggaaga gggtggcttc 56 Q 
ceatgctatg Acgt^regct* t^ACc^ago 1020 
gagatcatgc acgccctgaa gatgacc-tgg ID BO 
sagacgeeca tewgaacag ggeefc tcta-c 1140 
gactatgaga agatgtttgg cacgaaafcgc 1200 
gaccgcttcc tggaggcect gggcttcago 12 SO 
tgfccagatca acctggaagg aaagacefcte 1^20 
agccatgcct tctctcatgt gtgagcccct 13 BO 
eetgagggge ctggagtegt ggcectgcdt 1440 
aecctggctc- ctggcccgag cctgggctcc 15Q0 
aecccaetco etacaccacc acagcacacc? 1560 
ccagtgccac aataaacctg tacccagctg 1620 



<2l0> 2.3 

<211> 1655 

<2l2> DMA 

<213> Homo sapiens 

<4DQ> 33 

egaegcugag c^gcgccctg gccgggccaa 
gtagtgctgg aggggecage a eett^ggd 
gtgcdCCtct ceatttcceg gctcactcct 
gtgggtgact gggtgc'fcgag catcgatggc 
gctcagaaca agatceggge ctgeggggag 
ccggttcaga gcaaaccgca, ga a^gcctCC 
tttgeaccoa gcgtctccct caacaagscg 
gacagcgccc cgcaaC&gftA tggaoagocg 
cageggetga tggagaacac agaggactgg 
tccttccgca tc-cttgeeca cctcaeaggc 
eaeetgaaga aatcaageca ggtgcccsgg 
ccccaggagc ccbg^CCt^g o^etaccgee 
gtggaccctg cgtttgccga gcgctatgcc 
cacagccagc <z$$cmcg cacgccgctg 
geeggagggg tgecaggagg gggdn^c^AC 
cAC^^ggtca teeggggecg ctacctggtg 
tttgtgtgta gccagbgtgg g^Ag&tcCtg 
gceatcttct gcecaccatg etatgaegtg 
aagaagatta caggcgageat catgcacgcc 
Aectgtgotg octgeaagae gcccat-ccgg 



geaggagecg gcateatgga tt.Dcttcasg 5D 
ttccggctge AA^^ggca^ ggacfttcsaat 120 
gggggcaaag eggogcagge cggagtggcc 1BQ 
gaga^tgegg gtagcct<?ae acacatcgaa 240 
cgcctcagcc tgggcctcag cagggcccag 3 DO 
goccccgccg cggacccfccc gcggtacacc 3^0 
gcccggccct ttggggcgcc cccgcccgct 420 
etcegaccg^* tggteccaga tgcc?agoaag 450 
cggccgcggc eggggacagg ccagtcgcgt 540 
acogagttca tgcaaga^co ggatgaggag 600 
scagaagccc cagccccagc ctcatctaca 650 
eccagcceta ecageegccc gceetgagct 720 
ccggacaaaa cgagcacagt gctgacccgg 760 
cagagccgca cctccattgt gcaggcagct 840 
dt^eegt^tg tc^cc^gt^c; 900 
gogttgggcD acgogtacca cceggaggag 960 
g*ag^9g^tg gcttctttga ggagaaggge 1020 
cgctatgcac ccagctgtgc csagtgcaag 1DB0 
ctgaagatg^ cctggcsc^ft gc^e^gcttt li40 
aacagggect- tctacatgga gsagggcgtg 12 00 

12 



WSDOCIO. <WO 006617BA1H_> 



WO 00/66178 



T'CT/US0O/116*4 



ee^tafctgcg agogagacta tgagaagatg tttggDacga aatgccatgg ctgtgactto 12 6 & 

aagatcgacs ctggggaccg cttcotggag gccctgggct teagetg^ca tgacacotgc 1*2D 

ttcgtctgtg cgatafcgtca gatcaacctg gaaggaaaga ccttctactc caagaaggao 12 BO 

AggC£?tCtet gcaagagcca tgccttctct catgtgtgag cccettcfcge cCAcngctgc 1440 

cgoggfcggcc cotagcctga gg^gcctgga gtc^tggcw tocatttctg ggtagggctg 15 00 

g^aatggttg ccttaaccct ggctcctggc ccgagcctgg gctcccgggc eeobgctfeac 15 CO 

ccaccttatc ctccc^CCCC .^CtCC^t^Oa cc-Ticcacisgc acaccggfcgc tggccac-acc 1620 

agcc^ecttt cacctccagt gccacaataa acctgtaccc agctg l£f?S 



<210> 34 
-=211> 223 
<212> P£T 

-=213> H&mo eapiens 
^400> 3d 

Mot >*&p &<zr P^ft Lys val Val Leu Glu Gly Pro Ala Pro Trp &ly Phe 
15 10 15 

Arg Leu Gin Giy £ly Lys Asp Phs Asn val Pro Leu Sex lie 5er Arg 
20 2b 2D 

Leu Thr Pro Gly Gly Lye Ala Ala Gin Ala Gly Val Ala val «giy A^p 
35 40 45 

Prp Val Leu Sgr 11c Asp Gly Olu hzn Ala Gly &sr Leu Thr His lie 
SO 55 60 

Glu Ala Gin Aan Lys lie Arg Ala Cys Gly Glu Arg Leu &sr Leu Gly 
£S 70 75 £0 

J.*ru £tr Ajrg Ala Gin Pro Val Gin Ser Lys Pro Gin Lys Ala £er Ala 
65 SO 

Pro Ala ALa Aap Pro Pro Arq Tyr Thr- Ph^ Al* Pro Val &cr Leu 

1D0 105 11D 

Aen Lye rhr Ala Arg Pro Phe Gly Ala Pro Pro Pro Ala Asp Ser Ala 
lib 120 12f, 

Pro Gin Gin Aan Gly Cln Prn Lw Arc? Pro TLcu Val Pro Asp Als Bcr 
12Q 135 140 

Lye Gin tog Leu Met Glu Aan Thr Glu Aep Trp Arg Pro Arg Pro Gly 
145. ISO *&$ ISO 

Thr Gly Cln ficr Arg Scr Pne Arg L£u Ala Hit? Leu Thr 31 y Thr 
165 170 175 

Glu Phe Met Gin ABp Pro Asp Glu Clu His Lou Lys Lya Ser ser Gin 
1B0 1B5 190 

Val Pro Arg Thr Glu Ala Pro Ala Pro Ala ser Ser Thr Pro Gin Glu 
LSI!? 200 205 

Pro T*p Pro GJy Pxo Thr Ala Pro Ser Pro Thr Ser* Arg Prt> Prtj 
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210 225 22D 



<210> 35 

<211> 20 

c212> D>2A 

<21£> HOTr/> aapiens 

gagccggrat eatggsttcc 20 



<210> 35 
<2ll> 20 
.-2i2> DtTA 

<5l3> Komq sapiens 
<4O0> 

getgcctgca c*afcggaggt 2D 



<213> Hdpeh} sapiena 



<40O;> 37 

cgaegcagag ca^cgt-e-Ctg gccgggccaa 
gtagtgstgg aggggccagc accttggggc 
gtgeccctct ccsttt<?isfia gctcactcct 
gtgogtsact gggtgct^ag catcg^tggtf 
grtcagsacR at?At.ecgggc ctgcggggag 
crcggt-fc^ags gcaaac-D^ca g^^ggh^aag 
gtcccagatg ficagcaagca gcggct^atg 
g$gacaggcc agtcgocfttc ottcogcatc 
caagacccgg at £T a 9£J a 9 ca cctgaagaaa 
Sccercagcct ca fectac* oa ccaggagccc 
agccgoccHt- cctgggctgt ggaccctgc*? 
^gcacagtgc tgacccggw eagccragccg 
tccattgtgC aggcagctgc csgag^ggtij 
c£?egtgtgtc accagt.gcw eaaggtcatc 
gcgtaroace cggaggagtt tgtgtgtsgc 
fctctttgagg agciaggg c*?c eatcttctgc 
agctgtgccst agtgcaagaa gaagattaco 
tggcacgtgc actgctt t»rf etgtgctgcc 
tacsteg^gg agggcgtgcc ctattgogftg 
tgccatggct gtgacttCAa gatcgaogct 
agcbggc^tg acacctgctt cgtctqt^og 
ttctactcca ag^-^gaeag gcctctctgc 
ccttctgficc acagctgccg c^gtggrcaec? 
catttct^gg t..>gggctggc aatggttgcc 
tcccgggccc tg ccca 



gcagga^ccg goatcntsQ* Ltecttcsag 60 
btec^gctgc- aagggggcaa ggacttcaat 120 
gggggcaaag cggsgn*3g£ cggagtggcc 1BD 
gagaatgcgg gtagcctcac acacatcg^a 24 D 
cgcctcagnsc f.gggcctcag cagggcccag 30 a 
acccctgaca aaca-gccget c.cg-^^gc?Lg 360 
gag^^^a^ag aggsctggcg gccgcggccg 420 
cttgcccacc trac?iggcJ": rsgagttcatg 4 B0 
tOAAg^cagg tgcccaggac agaagcOTca 540 
tggcctggcc ctc»<7tfge£?ec' cagccctacc 6D0 
tttgccgagc gctatgcccc ggaceitfaacr-j 660 
gccacgccca cgccgctgfta gagecgcacc 720 
c^aggagggg gcagcaacaa cggcaagact 7fc0 
oggpgcogct ncctggtgs^ gttgggccac 640 
cA^ttgtggga aggtcctgga EtgagggtggQ 900 
ccaccatgct atg^v^tgog ctatgcaccc 960 
ggpgagatca tgcacgccct gaagatgacc 1020 
tgcaagacgc ccatc^gsaa cagggccttc 1080 
c*gagactatg agaagaOgt t tggc^cjgowm 114 0 
ggggaccgct tcctggaggc* cctgggcttc 12DD 
atatgtcsga tcaacctgga aggaaag*3tf<-* 1260 
aagagccat?; iM?tLctctca tgtgtgsgcc 1320 
tagcctgagg ggcctggagt c*?tgg«e£*t.g 12 BO 
ttaaccctgg tftectggccc gagcctgggc 144 0 

145G 



<210> 3ft 
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*212> FRT 

<2l3> Ho*o aapianB 
<400?- 35 

Met Aep fler Phe I/ye Val Val Leu Glu Gly Pro Ala Pro Trp <Aly Ph» 
IS 10 15 

Arg Leu Gin GLy Gly Lya A3p Phe Asn. Val Pro Leu Ser He Ser Arg 
20 25 30 

Lev) Thr Pro Gly Gly Ly£ Ala Ala Gin Ala Gly Val Ala Val Gly Aap 
35 40 4B 

Trp Val Leu Sar He Aap Gly Glu Asn Ala Gly Ser Leu Thr Hi b He 
50 55 60 

Glu Ala QIti A£*i LyB Tie Axg Ala CyB Gly tfiu Arg Leu Ser Leu Gly 
65 70 75 90 

I-*eu Ser Arg Als Gin Pr£ V£ 1 Q\Tj Abu Lys Pro "Sin Ly3 Veil Gin Thr- 
H5 *0 95 



Pro Asp Lys Gin Pro Leu Arg Pro Leu Val Pro Aep Ala ser Lys Gin 
100 105 110 

Arg Leu Met Glu Aeti Tbr Glu Asp Trp Arg Pro Arg Pro Gly Thr Gly 
115 120 125 

Gin ser Arg Ser Pho Ar$ 3 He leu Ala Hia Leu Thr Gly Thr Glu Ph* 
130 135 1*10 

Met Gin Asp Pro Asp Glu Glu Kie Lbu LyS l^e Ser Ser Gin Val Pro 
145 150 15S 160 

Arg Thr Glu Ala Pro Ala Pro Ala Ser Ser Thr Pro Cain Glu Pro Trp 
165 170 175 

Pro Gly Pro Thr Ala Pro Ssr Pro Tftr Ser Arg Pro Pro Trp Ala Val 
ISO 1B5 390 

ABp Pro Ala Pb* Ala Glu Arg Tyr Ala Pro Asp VyS Thr Ser Thr Val 
1S5 200 205 

Leo Thr- Arg Hie- Ser Gin Pro Al* tfhr Pro Thr Pro Leu Gin Ser Arg 
210 215 220 

Thr 3« ILe Val Gin Ala Ala Ala Gly Gly V^i Pre Gly Gly Gly Ser 
225 230 235 240 

Aan ABn Gly Lys Thr Pro Val Cya Kia Gin Cys His Gin Val ELe Arg 
245 250 255 

Ala Arg Tyr Leu Val Ala Leu Qly Kie Ala. ryr Hie Pro Glu Glu Phe 
260 265 270 

Val Cys Bex* Olu Oys Gly Lya Val Leu Glu <51u Oily Gly P^e Pfcft Glu 
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275 2BQ 2B5 

Glu Uvb Qly Ala lie Phe Cye Pro Pre CyB Tyr Asp val Arg Tyr Aia 
290 2£S 30D 

pro ser eye Ala Lye Cye hyz Ly& Lye He T'hr Gly Glu He Mfct 3ie 
305 210 3i5 320 

Ala Leu Ly* Wet Thr Txp Hia val lbu C*/3 Ri* Thr Cya Ala Ala Cys 
325 330 335 

Ly* T-hr Pro lie Arg Asii Arg Al* J?he Tyr Met Glu Qlu Gly Val Pro 
3<10 345 ^50 

Tyr Cyfl Qlu ABp Tyr Glu iya Met Pbe Gly Thr Lys Cys Gin Trp 
555 350 365 

Cye A Bp Phe Lys Ilfc Aep Ae P Ar 3 phe Leu Glu Ala Leu Gl V 

3*70 375 390 

Phe Trp Hie A Bp Thr Cya Vh* Val Cy£ A2a 31* CV« Sin He Asn 
30S 320 3$S 400 



Lbu Glu Gly l^ye tfbr Phe Tyx Ser lya Lya Asp Arg &ro Lbu CyB Lys 
405 01D 415 

Ear h±b ALa Phc Scr Hi* Val 
420 



<2l0> 35 

<211> 1575 

<2±2> JDWA 

<212> Hottid sapiens 

c40D> 39 

egaegcagag Cfrgogccctg gccgggccaa 
gtagtgctgg a^gggecage acc-ttgg^gc 
gtgccccfcct ccatttceeg gctcactcct 
gtgggtgact g^gtgetgag cafcc?gAtggC! 
gctcagaaca agatceggga ctgeggggag 
ccggttcaga gcaaaccgca gaaggccfcCC 
fctfcgca CCC a gogtctccct caacaagacg 
gacagcgccc cgcaacagaa tgjgtgeaga 
tcdC^gAtgc cagcaagcag eggctgatgg 
ggacaggeca gtcgdgttecr ttc?egeatcc 
a<igaceegga tgaggagcac ctgaagaaafc 
ccccagcctc atctacACC? eaggagcect 
gecgcccgcc ctgggDbgtg gaccctgdgt 
geacagtget gaeo<?ggcac5 sgccagccgg 
ccattgtgca ggcagctgcc ggaggg$tge 
ccgtgtgtca ccagtgccae aaggtcatcc 
ogtaccaccc ggaggagttt gtgtst&gcc! 
fcctttg$g£& gaagggcg cc atcttctgcc 
gctgtgccaa gtgceiagaag «^Attacag 
^gc&cgtgca ctgctttacc tgtgctgcot 



geaggagecg gcatcabgga ttccttc^&g 60 
ttccggctgc aagggggcaa ggacttcaat 120 
gggggcaaag cggcgcaggc cggagtggcc 1&0 
gagaatgegg gtagcctcsc acacatcgaa 240 
Dgcctcagcc tgggcctcag cagggcccag 300 
gccceegcfeg cggaccctcc gcggtacacc 36 D 
gcccggccct tbg^ggc^cc cccgtfecgct 42 0 
cccctgacaa acagcogctc cgaccgctgg 4SD 
agaac*ciiga. ggactg^cg^ ^eggcegg £4Q 
ttgcccacct cacaggcacc gagttcat^c 600 
ca5»gcc<igBt goc<?*fifgd<«a gaagecceag 660 
ggDctggccc taccgccccc agccctacca 720 
ttgccg^geg etatgccccg gacaaaacga 7 B0 
ccacgcccac gccgctgcag agccgcacct 340 
caggaggggg cagcaacaac ggcaagactc 300 
9995^cgcta cctggtggcg ttgggCcacg 9$0 
agtgtgggsa ggtcctggaa gagggtggcC 1020 
caccatgcta tgacgtgogc tz>.tgdo<Cci» 1060 
gegagatcat gcaogccctg aagatgaccb 1140 
geaagacgee dat-CC^g^d *egge«tt<2fc 1200 
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acatggagga gggcgtgccc tsttgcgagc gagactatga gaagaogttt ggcaogaaat 12 6 o 
gecAtggetg tgacttcaag ategacgctg gggscegctt cetggaggee ctgggettCA 1320 
gctggcatga cacctgcttc gtcfcgtgcga tatgtcagat caacctggaa ggaaagacct 13BO 
tct^etccaa gaaggacagg cctctctgca agagecatgc cttctctcat gtgtgagce^ 1440 
cttctgccca cagct-gccgc ggtggccccfc agcctgaggg gcct-ggagfcc gtggccctgc 1500 
At-ttct^ggt agggctggca atggttgcct taaecctggc tcctggcccg agccLgggot 15GO 
cccgggccct gccca 1575 



-:210> 40 

<211> 153 

<212> PRT 

<213> Hc-tliO papiene 

<400> 40 

Met Asp Sex Phe Lys Val Val Leu Glu Gly Pro Ala Pro Trp Gly Phe 
15 10 15 

Arg Leu Qln Gly Gly Lys Ae?p Pbe Aen Val Pro Leu £er lie Ser Arg 

20 2E> 30 

Leu 'J'hx Pro Gly 01 y Lys Altt Als. flln flly V»l *:tA Vfrl GJy 
35 4D 45 

Trp Val Leu Ser lie Asp Gly Glu Asn Ala Gly Sex Leu Tbx Hlb lie 
SO 55 &D 

Glu Alrt Gin Acn Lye lie Arg Aid Cys Gly Glu Arg leu Ser Leu Gly 
65 70 75 SO 

Leu Sex Arg Ala Gin fro Val Gin Ber Lys Pro Gin Lys Ala Sex Ala 
G5 90 95 

Pro Ala Ala Asp Pro Pro Arg Tyx Tbx Phe Ala Pro Sex Val Sex Leu 
10D 1D5 110 

ASH Ly:> Thr Ala Ai^ P>r<p Ph<* Hly Ala Pro Pro Pro Ala Artp Sftj" JUa 
115 120 125 

Pro Gin Gin Asn Gly Cya Arg Pro Leu Thx Asn Sex Arg Bex Asp Arg 
130 1^5 1^0 

Trp Se.r Qlu Neh Paro A] ^ Ser Ser Gly 
145 150 



<211> 2474U 

<S12> DNA 

<213> Homo sapiens 

<400> 41 

nnnnnntgta ttt tat cats ttttaasaat caaasaacaa aaggcagttg aggttaggca 60 

tg^agstfcrtcj l:gtfafcgt*$t i2c«a<jr!At'Lt Lgggaagoeg aagcacgtgg at.caoctgag 120 

gtcaggagtt cgagaccagc ctgcccaata tggtaaaacc ctgtctctac taaaaataca 180 

Aaaaattagc saggeaLggt: ggtgggeace tgtaatccca gctacttggg agaotgagga 2<10 
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aggagaatca cttaaACC'csg ggaggcgggc tgggcgcggt ggctcatgcc tgtaatccca SOO 
gcactttg^g aggccgagac aggcggatca tgaggteagg agatcgagat catcetggot 360 
aacatggtga aaccccatct ctaqtaaaaa tacaaaaaaa attagccagg cctsgtggcs 420 
ggc&CCtgta gtcccagcta cttgggag^C tgaggcagga gaatggcgtg aacctgggag 4 BO 
gaggcqttgc Rgfcga-gccaa gatcgcgcca ctgcactcca secfcgggcga caagagtgatj 5^0 
actccatctt aaagaaaaaa aacaaaceeei g^aggcggaa attgcagtca gccgagatct 600 
cgocattgca ctdAAgtatg ggtgacagag caagactcca tgtcaaaaaa aaaggcagtt 6S0 
gacaggagicA aggagcetgg tga^gaagct gtggc&tttg scccggctgt gttgctatgg 720 
cjccagggtgg tgctagtAS* ggagctgagt gggaaagagc acaggggaca tgctgaaggc 7 BO 
ctgggfcgtgg ggatgaggca gagattg^gg gcacettgca gggtoatage aggtggotgt 84 D 
ggtgagat^g aggaAg&cao ctggggtact gctctaggct gtcagacata cagaagctgg 900 
cccagecaag occaggggct gcaagg-gaca tcefctttgtg tcccc^gtga tctgcagctc 960 
tcagacaccc tcaaacacag tgcotcttgc ccagcccagc acfcctcagbg gggagctf&eig 1020 
tgggagaaca ggctcggaag gggscCtagg cttatgcagc gagcogggca aagctggaac 1000 
tggagecca^ gccectggat gccccctggc ttgt^gsigtt ctoggat^Ct gaggggaggg 1160 
gacagggcat gggagfcgcgg tgctctcacc ttbgacttga actcattccc caggggacag 1200 
gggaggcetc etcaggatcc acagatgccc agtctcccaa gaggggcctg gtccccatgg 1260 
aggaaaactc catctACttfC! tCCtggcagg aaggtaagtt ggaggacgtg caagggcagc 1330 
ctcagcccc^ cacaeccagg gctgggtctt ttt^ctgaetg aeggagofcgt ectggccacc 
tgeeacagtg ggcgagttte ecgtggtggt gcagaggact gaggccgcca ccogct-gcea. D 
gctgaagggg ccggccctgc tggbgctggg cccagaegoc? atccagctga gggaggccaa 1*00 
ggeacccagg ccctct&<?&g fttggccctac cactfccctgc gcaagttcgg ctccg*e$>ag 1SSD 
gtgaggtgca^ggggtgggaa^aQggtgaggfcr gctgacagcc- tggaccctcc tgctaatccc 16£0 = 
cacccgtgtg ccJCtgtgcec agggcgtgtt ctcctttgag gccgsce^tc getg^eaotc 16 so 
gggtgagggc ctctttgcct tcagcaccce etgtgcccct gacctgtgca gggcfcgtggc 1740 
cggggccatc gccgccageg ggagcggctg ccaga^ctga ccaggeccca g^ectgcccc 18 DO 
Ctgccacggg ccacctcfcct gsCCtCCctg gacacccccg gagagcttcg ggagatgtfC* 1S€Q 
ccaggaccfcg agcoacccac gtccaggaaa atg^aectgg cegagcccgg accccagagc 1520 
ctgccgctac tgctaggCCC otgagoccaac gatctggogt ccgggctcta cgctteagtg l^Bfl 
tgcaagC^tg ccagtgggcc cccaggcaat gagcacctct atgagaacct gtgtgtgctg 2040 
gaggccagcc ccacgctgca cggtggggaa cctgagcogc acgagggccc oggoa^cgC 2J-QO 
agccceacaa ccagtoccat ctaccacaac ggcCJA^gact fcgagotggec cggcccggcc ^L60 
Aa^gacagta ccctggasgc ccagtac^gg cggctgctgg agctggatca ggtggagggc 2220 
acaggccgcc ctgaccctca ggcaggtttc «aggccaa$<2 tggtgaacct gctgagtcgt 2280 
^agcggagga agggcccagc cCCttgtgac cggcccbgsa ogcccagcag a$t$$tgz*Q 23^0 
agacrggg*9 a ggtgctcccc ctgggacagg ^gtrgtgggct ggtgggcaaa cattgggccc 2400 
atgcagacac acgcctgtgfc ccaocctggc ctgcaggaao aaggcaggcc gcct^tggag 2d60 
gacctcagce ctgccctgcc ctcctcatga atagtgtgca gactcacaga taataaagcfc 2S20 
cagagcagct occggcagggf gcactcacgg cacaogcccc tpcccftcgtt catfcgcggec 2580 
AAcacaagca ccctgtgccg gttcc^gggg cacaggtgac ctgggcctta cctgccaccc 2640 
gt^ggctcaa acccactgea gcagacagac gg^atg^^^a tcattagga^ tccatgttgc 2700 
tctgcacggc cgagbgacac gaogAgg c?ag gcggagggag ctgtgaggct tacttgtca^ 27^0 
actcaggaa^ gageaacatg agggcccaac tggagac^cg g^ggcocsgag ctgggaggag 2B2D 
gcagtggggg oggggtgcag gtggawggga tttcagagac accctcgtcc aaaacacbtg 2&80 
ttccctgctg A^^Ctccaae aatttgca^a tacttctg^g ^ACCCCaggC! gtcagtctcc 2540 
tcatctgtaa agga^aga^a act?gAtgacg tatcaggcat aatccttgat gagagfctt^C: 300(> 
tgojtgccta ctcagtgcca ggogctg^gg ^w-ac^g ecg tgtteaggac agccttggtc 3DS0 
ctgttctccg ggagcogaca ttccaggggg agagaagttt cctgaagact tccab«5Ctgc 3120 
gttc^ctwt ctgctcctgc tcctggcgcc atwtaggag ceagccatgc acgcaagcgt ilBO 
catgcctcca gggctctgac? tgcccagcce ctcaccgcaa ctccacctca gctgcaoaea 22d0 
cecttggcae atcctgaacc tcattfctcufc ga^gacaca caatttttgc tctcfccctgt 3300 
ccaagcctca tcctctgg<?c gecacctc^ct tccagctcao fctcotttagt gcggcca^ta 3360 
^cg<^ecctgc ctaggcatgt cgacct^C^g ggaecctttt ctggctcttc gaggcctctg 3420 
cccaccatcc cdtctttgtt otccatagtc ccttccccct gttctctctc gtttcatctt 3 a BO 
aetggtctgg caasgtcccc ggccttggge gagccagacc tcctcagbgc ctgcacacag 3540 
ctgcccacag ecagagaaafc ccatttaagc agactgcctg catccttctt aacagtgcaa 360D 
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ggcaggcaCt ccctgooaca agagaccctg ttccCtagtft gg^CAgcttt tetCCtCCCC 36£0 
agaacctoct gtctatcccc acccaatgtc tactcacagg catattgggg aaacaggfcca 3720 
ggctctccca ccgtatatgc aagtgtactg gcatcCstCt gfccttCttCC tacecctaca 37*0 
gtagaaacag tgtctgtccc cagctgtgct ctgatcccgg ctcctttcac ctcagagctt 3&40 
ggaaaattga gctgtcccca ctctctcctg c^CCCattca tCCtaccagc agcttttoca 39 DO 
gccacacgca aacatgctct gtaatttcac attttaaacc ttcccttgac ctcacattcc 3360 
tcttc^cca octctgtttc tctgtfccctc tteacagcaa aaaetgttca aaagagttgt 4020 
tgattacfctt c&fcttccact ttctcaccoc cattctctcc tcaattaact ctCCttcattf 4000 
cccat^atgo cattafcgtgg cttttattag agtcaccaac- cttattotac aaaacaaaag 4140 
oaacaaggac tttgacttct cag^flgcact cagctctggt tctfcgaaacs cetfccgtt&c 4200 
tbgctattcc tcctacctca taacaatcte cttcccagco tctactgctg ccttctctga 4260 
gttct-tccca g^tCCtagg etcagatgta gtgtagctca accctgCtac? acaa.aga.atc 4320 
tcctgaaagc ctgtaaaaat gtccat^at gttctgfcgag tgatctacca agaaaataaa 4^00 
aaattttasa aAtcaaatafc ccatgcctgg gcccacacgc aggggCtCtg atttcatcag 4440 
tctggtaggt gggttctgsg catec&cgc : t cactggattt ccggatgatt gtagtatgca 4500 
gcctaggctg ggaaccaetg gcctcagcaa gccagtcatt ctceaggtgt cacagaccct 4560 
Ctaggtgcta atgaccccga aggtctgtet tcagtgcaca octccccctg agctccagat 4620 
ttaggaafccc cactgcacac gagacatctg gatgtggaa* agac&tCtCC agate ccatg 4 6 B0 
ggtgaaaggg ggttggggga atggagaotc gtgttcttcc aggatgtgtg tggacacaga 474 0 
atgcaaftgf£e tggagggatg etagagecat ag^aa^^aag attt^rggctc act tat teat 4 BOD 
gcaagcactt cctgatg$$t aaggtcttag agcaagctga ggecaa^agg c^SC^Steg 496D 
aggtgetget gcsggcaccc ccactcceta tfagtggcaag cccaagccca gcccttggca 4520 
gctcaaatcc c*33acaegtf tgaaggtcac ccagagagtc a^gggeatgg ctagaaccag 4930 
aac-ecaggac tctggggacc ca^rC^tggca tcctttoctt cattacaaat cfcgagctgct 5O40 
ttgtttccta ^ggatttctg tgatattcca aggggactgt g$9&aagaaa gtccttggaa 5100 
accaaoagga egctagagge ctg^cetgga gectcaggag tctcgoccac cagagggege 5160 
tgggtcctts twaggtcoa gttgetaege asggsctgee tgtgetggga ggctccccag 5220 
gggacacaga ccagagcctfc gcaccagccc aaggaatggg agocfcggggt dct<?tetset 5260 
gga^gACtgc cag^a ccoca aggctgccgc ctcttccttt getcatttge tgtttcactt 5340 
tgtcaatcct tcctttcttc gtgtgttcat tcacatccac tgtgtgctg^ ecctggggaa 54D0 
abgttagata agacacatta gctgtgtgtc ttcattgtcc t3acaaagass cacaccctgg 5460 
aaagagcaco gcaga^agfce ccoattcccB catctccctc cacacatgcfa atcfcggagat 5550 
gectttteca catccagatg tctctgstg? tgtgggattc ttaaataaac asacatttca £560 
tacagaatgt ga^ab$ftt^g agatgetatg gggaaaa^ta aagc^g^gg agggectagt 56 dO 
^tgtgatgcg ggtgaggcat ccagg^attg ctgtttcagc tgtgatcagg aaaggccctg S700 
ggaggaggee acatctg&gc agagacctaa ataaagttgg aiiaccrt^fttg ctgagatatc 5760 
tsgagaagtg tttcaagggc cgggcacC^g gqatggtggc tracgectgt aatcccagca S620 
ctttgggagg cca^gcagg tggatcgctg gaggtcagga gttt^agragc agcctgacca 5BB0 
acafcggagaa accccatctc tactaw«c*t ataaaaatta teegggcatg gtggtdcatg 5940 
cctgtagtoc cagct^ctcg ggaggttgag gcagga^aat cacttga^C-g tgggaggcag 6000 
aggttgeage aagecgagat cacaccac-tg cactccagcc tggatgacag agc^agactc 506 0 
cgtctcaaaa aaaaaaaaga aaagaaaaaa gaaaaaaaaa gaaaagtgtt tcaagcaggg 6120 
gaactggcaa gtggagaggo ccfcgaggcag aaatatgett ggcctgctgg agga^at^t^ €1^0 
agtgag^&gg teagggtggc tggagtggag ggagc^A^t^ gt^ggagtca gacccagttt 6240 
attcatattc tgtaggtctt ^^ggact tea gttttatttt gagtgeaata t^agcccACt 6300 
ggaatgctaa aagctgagag tgacatggig ctgtgattct ggctttaaaa atatcacttt G360 
ggctgetteg tgaagactct ggaaggggca agggtgaaag cagggotgee egttaggaga 6420 
cc?gttacagg ggogcaggca caaaatggca gtggetggga caatggtggc ageageggtt 64 60 
agatgtgaac «tgtt^a*g^( tggaatttgc agaat&tggg ggags^cae^ agagaaagga 6540 
taacttcatc gtttctgctg aacc^ttgg ataaatgttg gtggcacttc ttgaagtgag 6600 
gaaggagrtta ggaaggtggg aaaggcacaa gtttgaattg ggcc^tgatg gtctgagata 6660 
cctagtacag tggttcccca acctttttgg cagaagggac cgctttcatg ^aagacaatt 6 720 
tttccacaga ctgggggtgg ggtggggatg gl^ttc^ggt ggttcgagtg cagtacattt 6 7BO 
atcattagstc tctttttttt tttttttttt tgagatagag tDtcgctctg tcaccc-acac 6540 
tggagtgeag tggagecate ttggctcacfc acaacefcetg ctgcccaggt tcaagtcatt 6 900 
ctcc(sgcctc agcctctcaa ^tagetggga ttataggcat atgcgccaco acgccoagct 6 960 
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aattbttgt* tttttagtag agaoggggtt bcaccatatt ggccaggatg gtctcgaacb 7020 
ootgacctca agtgatcctc cccogcctca acctCCC*A» gtgctggggt tacaggogtg 7DB0 
amcc*ctefc& ccoggeccat ttatcatbag attctcataa ggaatgagea acctagatcc 7140 
ctcgcatgca cagttcacaa tagggttcac gctootatgg gagtctAAtg ctgccgctgc 7200 
actca^ctto totggcttgc <?gctgctCAC cfctctgctgt gcagcccagt tcofcaacagg 7260 
ccacaaacgg ggagtb^ggg acccctgatc tagtaaacat ctaggca^tig tttfcggataa 7320 
tggagttaga gttcotgggg ^a^gtCA^O ctggccatga sacatgggat goctttgeat 7330 
ataggtggtg ttgaaagcca caggacagta cggggtctca gggggtgagc 1 ataaagagag 7440 
gcgaeateag atggccaagg ccaga-at^ea^ A^*ggatgg gaaggagggg ccagtggggc 7500 
aggggsaagc tgtgaa^cca gggaaaaagg gtgtttcgcg gaaaaggatc aA^cbpgacc 7560 
agtgctgccc ctaggeaggg oaggatgaaa cttaaccacc acggattcca tggDcccatg 752Q 
gcctccaggc ca<?aetgS£ AC cbtgagaaga gagatctcag gggacgggtg eseracaagag 76 GO 
occgcctggc atggcttcaa gagataactg aaggAAaGOa agtggagacg cgataaacag 7740 
acaact<?cet ^gaggaattt tactcfccgag aggagaatta aagggtagta gctggagasg 7300 
gatgtggggt caagagaagg tctttaacga cgagaactct cacg^c^tt bgtgcagaac 78 B0 
agggtgggtg tgatgactgfc Q^&t^gagag gggagaactg cagcgactct gtoctaggag 7920 
gaggtgatgg gccgggacca ccaagogsgt ggagggfcgga cgcecettCC? Ctcacccoga 7560 
^ACC'C^catg tgetcagtgfc cc*5t£CCgcc ggccctagtg cctgggctga acgeggggcc B040 
gggactctga ggacgcctcc caggcgcqca gtecgtctgg ccaaggftgga ^cgg^acggc 8100 
n^cttccgac ggtgegcggg te$gcbc$gg gtbgcaggga catcDggcgt ccgctcctgc B1S0 
cctgttttcc tgccttogca gagcgttgcg caaetofcagc tttaaacgce tfCt£tccccc- 9220 
tcaacttgtc tC^ccGagcc ccbctgAttb acagattctg cagtccccga gggttgcgcc B3B0 
fcacga ta ccg aca ctcgcgg cagcctgcga ggcgagtatg atcgt eeca t=t tt tcggagt 9340 
ageaaaotaa ggttcagaga etactabgtc cciig^tcg^t ctggtttgaa ggtccgcttt B4D0 
cctctccctc cgccagcggg cggtgcgagg gactgggcga ggcagogctt oectaagga.^ 9460 
gegacccgca gccccggccc ccteecgact eagccce^tt gcagggcccg ggtcggcgag B520 
gcctcfccagc bctaag^eeg acgggacttg gtgattgggc aggacggaag agctgggtgg BSSO 
ggctttccac cagoggagaa agtctagtgg gcgtggtege gaegag^ocj tg^C^bggbg 9640 
cCCcgcceec gtcc>gcgcg!C teaa&atgsa gggtggcbgt gggggogggg teagaacact B7DD 
ggcggccgat cccaacgagg ctccctggag ccegacgcag agcagcjgcM tggecg$£rCC 9760 
aageaggtat cgacgacegc gcqg^gc^rtC ttgggctgga ccaggogggc gcccggggcc BB2B 
tgctgaggac cacaaagggc actgggggtc gtggtccagg ctgtgcttOC tCC*cgct$0c 99S0 
cetggcccct gectccgccc cegpcccc^c cttcctgccg ctaagcoggc tgoggcgggg B94D 
cogatt-ggcg cctgccggct tcctgcgccg gggccagtct aatgcatggg gcwgg^cg^ 9000 
gggactaagg ggaaactgag tcac?gtc^5t ^tgggagcag ttctgtgtgg gaggcaccac 9050 
cccccactgg gctcg^ggaa ggatccccct ccaagctatg cttgagggtc ecagocec^a 9120 
tctgtctcca caggggccgc accceactec cjge^ttCWC ttcttca^ca cobaggggtc 9160 
ccgocctggc tccca^cagc ctcgactggt cccggaatgg ctaggaggat ccgctgcage 9240 
cgcctccctc ccctcccctc ccctcccctc cectcccctc OC?C!tCCCCtc ccctcocctc 93O0 
cccctcgc^t ccca^gcccc ogtgtgctcc ctccgctggc tctccgcaca gtgtcagctt 93S0 
acacgcctta tatagtccga gcaggctcca gcogoggc'ct gc'tgciC^rgga cctg^pggcg 9420 
g^gcr^9^g$^ £&$tfO£9Ccc ctjjactcacc oggaccgccc gaggctccag gctggcttgg 9d8D 
gH3^ggccg cgccagttta gtccctcggc ccac£?CCtgcf tt^CAAAS&a CCt Caagccb 9540 
ggAttcag^c acccctcacc gttccagtcc caaggggagg ggggctgctc ctgtctttca 
aaagtgaggt ccgccagcc«a geagoecagg ceag^ctgac aaaatacctg cctcctatgg 96^0 
cbt$9gcgtg ctcaggggct gcccgtgcct gcctggcccc tgtccaaggc tggtatcctg 9720 
agctggcccg gcctgcctgc ctgcccgccc aeeafcgct^ cCACtc«cct tctcttctot 97 SO 
oetcteacfsa gccgtrdAtcA tggatbccbt caaagtagcg ctggaggggc cagcaccttg 9B40 
gggcttccgg ctgcaagggg gcaaggactt caatgtgccc ctc?tGGattt eeogggt^ag 9900 
octaggtttg gggagggggc tcwccagc^ ^tetttcggt gcttaggtct ccagaggg^? 3SS0 
abggfg^ggag fccctaacagg agctggtcag gggccagcag gccaggagat gtctaggtoe L0020 
ggagatgtag tggtacctgc ctgccacaag gaetcecsat goggtggiit^ ctgggaggga 100BO 
gca^CCiaggc btCtdCag^c ctgc^ctgta cccgatgctg ttctcccaag ctcctgtggc 10140 
cacctctgag ggctggaggg aggctcattg tgcaggatgg gagccta^ca bttca^gagg 10200 
tatotaaact b^Agrgbggca atgcttggag ccaggcccca ggcaggacac tgtgactata 10260 
ggatttcact tcagcctcac tgccgcrcag ggaatageaa tcctcatt'CC! gbttbtocag 10320 
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atgagagaag Aftctcatgga gaggtggcgg ggctogctca tcgagtccat ggfcgaagcAg L03«0 
ggattggaat tgaggcaCA^ CAtggogtAC Attttttgt^ g^tagaaggg gtctctcccc 10440 
agcctat^ftA Aggacccaca tccactgtt^c ccatCcagga tgtggtggcc tttgaoacca 105 DO 
agcagaagtg tagga caggs ctccAttctA ggggcttAAe tteagcttcc AA^agcctgc 10560 
cctggtgtgg gtggagctgg aggctggctc ctccctgtag c-agggggatt gccttataag 10$ 20 
cccaagaatg -caigccccac!^ ct^ggatggc- eaacAgtggc tgcggtctgc Agag^t^iuia 10600 
agggctggcc taggcctggc cccctgaacc ccactggtgg gcctctcagc tggtcacc-ag 1074Q 
getgcagctc cagcfcgtatg gtceagttgt gagAcacAAC AAattgcctg tfccagagtgg 10SOO 
gtgaggcCAg cetgtoggct. ggcatctctg actggcctgg gggtcaggag ggggtgggga 10&€0 
cttcctgccc ctatatcege ctgcccegag agtacccacce Aggcgceggg tg^gcAsgc* 1O920 
gctgttgtca ggaagcccaa ggcaagccca gcctggaggg gcccagaggg togtggcctg 103 60 
aggaggggct cfca^CtggAg tctgtctgta ggagctgggc gtgggggtta gggtgggcAg 11O40 
$ccagcagtg ctcttctcag gggtcctttg atggcafctct cctggaaccfc gcccogccag 11100 
cagggtagtg a^geagfcggfc tgccotatga cacaogtccc actacatagc cctcaea.--ag 11160 
ce^tgaaacc tacctgacgt cctgotccct 9£$aA&gtgc tggctfcagtg tgtctgggga 11220 
gcctgA&cct eagtttcttc cctgatggag atgactttca gatatggcct gttgggggca 112 B0 
ctccgggctc cagctccctg gteagcatcv ctggcatgtg ggeggg-3 c e A ctagcfcgate 11340 
ccAgccctgg agttggacct gggcccacet gggtgggtga ggtgggcttfc tctgagttag 114DQ 
gccagccccc tccccctcce ctgaceecag Aatggaggga ggtggg&ggg gcaagggotg 114S0 
gc.t3fc9gg.ee eaggcctggg agatgaggta acgtctggga ct^gggggct gggctgetea 11520 
ggctgactca ccccCAecte afcgcagggtc cagccccctg gctttttccc teettggttc! 115BO 
ctctggectt accctgcccc tggcttgagc ccctccctgc ctctcfcccag ccacccgccc 11640 
agegctgtet tcbgrctetce^tgetgcfectc cccacgctct gaacacccct^catcctctgt 117 DO 
gettcetgcG ctcctcactc tgggaaggga acjccgtcccc gccccccacc ccctctccag 11760 
gagecagcta gcbgCAecCG aagacccoca cctcgggctc agcccacagc tcccaggagc 11620 
CAgccctgtg ggcagggagt ggctgggcca ggtttccctt ctaictgactc accotgacct 11860 
tgagfcaagtc jicttccecte tggggtgtca cttccccata cacagtataa ggggttgatt 11540 
tAgttggatt gaactaaagg tgaggga^tg gctcA^ggfcg: tctcc«sgtg ggctgncccc 12000 
tcagttgggG ecccatgctc agcagaggtg gcccacagtg gtggagcctt agggtcagag 12060 
acacttcctg gctctgcctc ttactagctg ggtgAettga ggcAAgttgt fctaaectctc 12120 
tgtgtACAtt tge&agtgca aaatgggtaa aatcccagat tactccacaa ggttgttgga 121 BO 
agattcagtg tcaatatgta. ^catAgttgg fcgeteaataa AetgaagcA* gtCttettAt 12240 
ttatgcgagtg aggasggggc cgccgagctc tettagcett ct^gacctcct aegcaagcaa 123 00 
gaggtcatgt tgaocccagc tc^CCfctt^t tttcecagtg ctgteaagct etgtgectgg 12360 
ct^ecctgoc etctgacatc tctctgaaac ctcttgcctc ccctctccct gcctcagctc 124 20 
agtctgtgca ctgacccace tgaggagect cctggggcca ctggcagcct qqaccccccc 124 BO 
Agafccccecc cacccagtga aattgtctfcd cagcactgcc tcacaa»*gc ctaottgatg 12540 
cagbgecagg cct^ttgfCCA gatggctggg tggtccctta ggcttggacc cagtcaagct 12600 
gccctgcctg tgttgctggg gctgggctag aggcctggaa gsggtttatc agggfccaccc 12660 
tctcagggco t^ggagatac ccaatcccag acattaaaac tgecagtage ccctctacct 12720 
teaaagecaa gtcctggtcc cttcccdtgg catteftd-^c eAtC^fcaA^t gae* CfcCtcac 12760 
ccgctA^cA geacacgcca ttctccttta ccgaggccca ccgcttcctc aaagtcattc 12H40 
ctgatggtct cagctcatgc tg^tg^e^ge catttetcee Agcctact^t etctactc^t 12900 
tgecaea^gA accsgggact cccagctcaa gagectgaag gattggggtc aggggaaatt 12960 
ggcagtcgag ggcttgggag t^ACJAgeCAt ^ftAtggceta cigsagtceea gctgtCAact 13020 
t*ggteecat teaggcagtg ttcacaggga acegggagat aacagggect gtfccctggct 130 HO 
ctcaaagggt cccagcagac cdCtAtagat ggcecocgac agggtgctgg ^gg^tgA^Ag 1314 0 
gtecataaga gcccccggtg gtttcgggga ggaagctgee ccctgcaC>gg gecagaggge 1330D 
atatotsgta ggtgga^tgg cetgggcagg aggecagcag gagaeteaaa aggeaatggt 11260 
CCtectgAAa caettgggct ttagectgag cgtggctgtt tgtggacatc atagcaattt 1332D 
ctggactgtg ggsgagggtg gtgocggtga atAgataagc atcgtgACtg $ggA*gctca 13380 
ggtgagcaee acctgaggga gagggtctgg cagtgaataa ataagcagtg tgactgggaa 1344 0 
attgtgaagc tcaggrtgAgc gccaocsacct cctgggttgc tttagtgte^ ageagctgee 13500 
tagaactatg ttgaatgaag agctctctgg ^ttctgga9>g tgggacagct ttgggtgggg 1356D 
CAgtgtt&cc Aecgtc2agcc tggcttgggt ctgcagggtc cagggcctcg gtcactttge 12620 
ttctctctDc acagctcact cct^ggggcfA A^geggcge<i geceggagtg gccgtgggtg i^^^D 
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actgggtgct ^agcatagat ggcgagaatg^ cgggtagcct cac*c&catc gaagctc^a 13740 
iCAAflateeg ggcctgcggg gagcgcctea, gcctgggcct cagcaggtat gcgggtggac 13B60 
atggatgggt gcgccogcgc tggcagtggg gatccctgcg gcccggc^ct? ctgrtcao^Ct 13960 
ttccttatcc tccagggccc *gccggttc*a gagcaaaccg cagaaggtac gaggctggce 1*92 0 
gggacatccg ^tgcggtgggc ggtgbgggct tggacggcca sgcetgetcg ccctestggc 133&D 
acattctcgg tacctfCAAtc cctggccggg agtggagggc agaaaccgga gctaaggcgg 14Q40 
gtcfcasggce ctggagttga gccaggggct gctgcacggt cctggcacoa cgoatgtecg 1410D 
cetgtctgt:.c cgcctgtctg tccgcctgct gcctcccgcc gcoggcgctg cgtgctcgcc 14150 
cgcaCtc^gt cagc-cctcgg tcctgcgtgg actgagAtCS£ c^frCt^ecaa atgggoacct 1422D 
fcgaaacctga gtcgtcctct ccccgtagcc tccaaataga tgtagggggt sgggtggggg 14280 
tgggg9£Ctg gagctgcogc tgtcctctgc tgcag^cgcc ccaettccac ccaggccocc 14340 
aocttaccct gcccgccegc eetgeecggc tgtgtctctg cccaggcctc cgccccicgcc 14400 
gcggAcectc egcggtacac ctfctgcaccc agcgfcct^cc tcaacaagac ggcccggcct 144*0 
ttgggcgccc ccgcccgctg acAgcgeccc gcagcagaat gggtacgtcg gcccCfb^CC 14520 
gcccgC^ccc aegccatcag gcccactgtg gcoccacgcc cgetgcacgc tgctgctcag 145BO 
tctgtgctgc gcCtfCageco ggcsggaaccg tgcggcaogc cccctggcgg C^ggggtggg iaS40 
^Ctgeaggca cagggcccct cccgatg^Ctg tggegcottg cagggcaccg cctggggagg 14700 
ggtctctgaa tgacgccgcg ccccctgctg gcggctggg$ £t-taggttgt ggtgtcgggc 14760 
cagctgagcc ccagacactc A£t$cogecfc tgtccccggc tgttDtgacc cctccccgtc 14520 
-tttct-teete tcctgtg.tct gfcccctttgt ccctttd.tcfc crtctgtctgt cttatttcct 14B0O 
tcac^aggtgc agaccectgfc c^agtcagtg ageccccctc tgcctgtgcc tttcttcttc 14940 
ctttt^gcac tctgggtggc ggcccctccc cadCCtggct gccctcctct ccacttcgcc 15000 
= ^cct^COTiric"i^aC!'^fe*^e cgcecagcag ggctcct^gc = ctcacccttA^cccactCC!ct 15D60 
ccdAtcactg taacccaaac ccacatgcac caa^tcetgg gaggggctgc ccccaccgcc 1512 0 
cacccccagt gtg^g^ftt^t gagccacacc ctccccacsg acagcCQCtc csgaocgotgg 151SD 
tcccagatgc cagcaagcag cggctgatgg ag^cacaga ggactggcgg ccgcggccgg 1524 0 
ggacaggcca gt-c$cgttC £ o ttccgcatDG ttgcccacct catfAggcAec gagttcagta 1530D 
ngtgccagcc cagggcaggg ggtacfcttCC tcsgcccccag cccaggcgtg atccctgacc? 153S0 
ctgtgtcttt tttggteaat gcctgccfcct gctctctcag tgoaagaccc ggatgaggag 15*20 
cacctgaaga aatcaaggfca cacjgg&cg^rS eaecagcccc tctcccacct cctgCC?tCtt 15^30 
ccattccagC fcactgeectg tgtctactcc tgaggctccc ^gctggggct ctcaafctctc 15540 
cettecttcc ttccttcctt cctt-CCJtfcce ttccctcctc ccttcctfcco ttecttcctt issoo 

GCCtfcCCtCC ttC^ttCDtt CtttCatttG ttCCCt<2<?et CfttCCttCC GtDCtDCCtC 15660 

cetgcctccc ttccatctct ccttC:<?ttca acttcttcct cDctctctct ctgcicectca 15720 
gggemaagtA tgtcetggag ctgcagagca caogctAC^d ccgcctccgg gactggcacc 1S760 
aecagcgctc tgcccacgtg cto^acgtgc agtcgtagcc cggcccfcct-c cA^cogg^tg 15340 
ccctctcfc^e ctcectcttt ctgttcctcc tgccca^^gc acseeccrttag tgcctccagc 1^900 
ttctgcctac ctcacccccc ctttcgtgco ectggcctga gcctcctgct ^efcctggcec 159S0 
tggdegwca eotgggttca tctgacactg cottCCCtCt ttgccctgtg gtacbgct^t 16020 
ctgccaggtc tgbgctgcct: tgggeatgga ataaacattc tcagtfcetgc ttgctctgcc 15DB0 
tgtettetat ctttgtggac ctggttfcgca tttggggtgt gggggtgttt cgtggttcgg 16140 
actgtttggg cCctgccgtc* ettgttttca gtgggagggg gt»<=ctggea aaggggccct 1620D 
gccctgccat cacagatggc ttcctg^cat gaggggagcc ccaggagctg cctoagaA^C? 162^0 
ggg^^cccfc^ c?ctcgtetcc! cagctagaga cogcaoacca gctaactgga cattgctagg 1B32D 
agaagctgcc cttcccatcc cbacccc^gt gggacctgga atccaactcg gcagtttcca 163S0 
cgcccccagt c^fttatcecgt ggggccagoa ggacccaggt tggggggtgg ggccatgtca 1644Q 
ggaagctcag ccatgcaggg cctt5*atgg cagatcttgc agDcaggt-gc ccaggacAg^ 1GSP0 
agccccagcc ccagcctcat ctacacccca ggagc^cctgg cctggtgaga gggagtgggc 165BO 
tcggscctgg gcaagggtgs gcageetoca ggggcatggg ggtg^tggg^ ttCt-CtC^ge 16 €20 
tg*etggggc tccacccccg tcctttgggg tccCtgggca cccctttaga gtcactttcc 16 6 B0 
ccggcaggcc ctaccgcocc cageectacc agccgcccgc cctggg^t^t ggaccetgeg 16740 
tttgccgagc gctatgcccc ggacaaaacg agc^wgtgc tgacccggca cagccagccg 16B00 
gccacgccoa cgccgctgcA gagecgcacc tccattgtgc aggcagctgd cggaggggtg 16BS0 
ccaggagggg gcagcaacaa cggcaagact cCC^tgtgte accagtgcca caaggtcatc 16920 
cggtgg^tgg cctgttcctg tccgaccctg gctttcccat cctgcsgccc sgcjcceacct 159B0 
gtotgcccac ctgtcttgcc tcagctgcgn dtggggggaa taaggattca gttctcagct 1704 o 
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gg&gtaggag tagggacctg ggctgggtcc tcccattctt aatcccacgc taactacccc 171Q0 
agcccaccca caac&actgc tagcagcatc fc^cc^tggcg aaatacfccfga agogccaacc i7i£G 
ataggctgaa gctgcacccc tacctttgct gctctctggg caaagagggg cctgocccct 17220 
cccagcgcgt ct^cc-vCtoc ctectgctet ctgtctccct ctgctctea^ a^catacagg 172e0 
cctggagcca ctccctctgt gcacfcgcccc gtggggccaa gcagcatcaa acacccccca 17340 
gratcagcgt gccggattct agagccttcc taattcgoag gcc-tggacCg ctctcatctc 1740O 
cgtcagotct tttttttttfc tttttgaaa<; agagtctcac fcgtgfctgccc acgttggcgt 174E0 
gcsgtggcgc gatctoggct cactgoaacc tctgeetcet gggttcaaga gattctccte? 17520 
cctcagcctc cfcgagtagct gggattacag gcacccgcca ccatgcctgg ctaattttgt 175 BO 
atttttagta gagacggggt tttaccatgt tggccaggct ggtofccaaac tcct CftCCtC 17£40 
aggtgatctc aggcctgcct tggcctccca aagtgctggg actacaggfcg tgagccactg 17700 
tgccdagccg aetctatcag ctcttgccag gtagaacagg caggccsgca ggacagggca 17760 
gctccagggt ttgcCC&qgg gcggctCfrgc ttttatga^ CtCCA^tC^t cagcccttcc 17820 
tccc^gggtc ctccctgctc taaagctgcc tctcctgtca ccagcagttc agtgtggogg 17BH0 
actggctctg tastgcttcat ggetgccacg gt^aettCCC aagcctgtct tCtatCCtAt 17940 
gtggaaaatg ggga-gaatga actgtccctc ccaaggocfcc ctggtgggbg gtcagtcaac 1BDD0 
ctgaaggggg ccaagacccc cacctctctg cgtgtgctec ctctgaacge fcCtcgcctCC 18060 
ctgeaggggc cgctacctgg tggcgctggg ccacgcgtac caccoggagg agtttgtgtg 13120 
tagccagtgfc gg^aaggt^e tggaagaggg tggcfctcttt gaggagaagg gcg^catctt 15X60 
ctgcccacca tgctatgacg tgqgcta.tq.c acccagctgt gceaagtgca agaagaagat 1*240 
tacaggcgtg agtagggetg gofcgg^gggg aggtggteee aagcetgtea gtggga&cga l£30O 
gggetgctgg gaaacccaca gtccaggtct ctccccgagt gagcctccgg gtccttacca 133E0 
gcgt aa taaa ""'t$$§Ct gctg= t a ctggcctc^accctgca £t agtcagga tg ctct t aaoaa 1S420 
atgaccatgt tcctgcbcag asaccgccca aggctgcaaa, gagoaggagg accaagccag 184 B0 
gagaagcccfc gggc?cctcct gactcccact ttgggctctc cctgccctgg tgaaatgaca IB 540 
ga&oggccaa cttgacadgc? tga&gctget ctgtctCAt^ c^tcretccrfcc atttetggat 13600 
ccagageeag ggctgccagg agtagccaga gagct-ctgtg tggtgatgtt catattagtg 13660 
aggtttaccfc tgacC^ogag cagtgggaaa etcaaaataa tggtggctfca tttcteafcet 1&720 
aaaaacatcc cggggtgggt ggtctgggac tgatctggtg gacccaggct ccgccttgtt 1&7G0 
gcttgactgfc fcggcagcace tg<?fctactta ecaofceatgg tgeaagatga cactteagcc IS 940 
tccgccaaaa tgctcacctt: ccagccagca ggaagfccgga aggagaagaa aggggacaga 13 90O 
gccccatggc gbccf^tcctt agaggatgct gccacetgaa cctctgcttt catcotgttg 16960 
gtcagaaccc agtcacatga ccacacccag tggcaacgga ggctgggaaa tatagtcttt 15D20 
attttgggca ccceitgt^gtc cagcaaaact gggggttcca tcagtcggca agaacgggag 19DB0 
agtggccgat gcsgtggctg atgcttgtat cccagcaott bgggaggtcg a^gtgggcag 15140 
atcacetgag gtcaggagtt caagaccagc ctggccaata tggtgaaacc ctgtctctac 1S200 
taaaaataaa aaaattao^t gggtgt^Ct^ gcgcacctgt cigtwcngct acttgggagg 15260 
Ctgaggcagg agaatcgctt gatcttgaga ggtggaggtt gcagtgagcc aagattgtgc 13220 
cactgccttc cagcctggga gacRgc?«i*ai aa^aaaaaa aa^aasggcc 15i90 

^ggeaeggtg gctcacacct gtaatcccag cactttggga ggccgagatg ggcggatcac 13440 
gaggtcagga gattg6jgacc; AtCCt^gcta ac*og^t^a* acCOcatctc tactaaaaafc 155&0 
acaaaaaaat tggccgggca tggtggagta gtcccagcta ctcgggaggc tgaggcagga 1S560 
gaatggcgtg aacctgg$Acr gca^fagcttg cagt^a^CC^ a^^tcjgc^ec actgcactco 13€20 
agcctgggea acsagagogag actcttgtct caaaaagaaa aaaagaaaga gaaatctgcc 1S6H0 
tcccagcctt gggctcctgc? ecbaceagcc cacaccwts gtn^ag<;ctc ctctccoAcc 15740 
agctcaaagc ccaagttcct tcactgtgac cttgtctgct cctctaaaac aggcaacacc 19B00 
agac^gtg«9 aagcw?<rcagc5 cagacatggg eagaaascet atttetgts* tctactggct 19660 
gtgtgagcag gggctagttg ctctctctgg gcctcactga agagaagggt ggcactatgc 19920 
tasg&ccggc «icggttgcaa ggtagatgta agatggggta eaggtgtfcgt ggagggcaga 1P980 
aatgcaccat ccgaaggcta catgtccccc: ncaottg*tgt cttgcttggc ccacaatgtt 
teattttaaa atcagtagca aacaatttaa aaaaccagaa gatttgcctg catgatgcag 2D100 
tggctcatgc ctgtaatccc wgdAOfcttgg ^A^gccaagg t^ggaggatt gcttgagccc 20160 
aggagttcaa gaccagcatg ggcaccatag caagacccct gtttctacaa aaaaaaaaaa 2D220 
attagaaaat tagccaa^tg tg^gtggeatg eac^tgtggt ccc«igctact tgggaggcag 20290 
agsgaaagtg agatctcctg cttttfcattb ctttatgtat aatgataggg tcttgctctg 20340 
tfcgcccaggc tgg^gtgcag tggcatgatc aetgcteacft gc^gcctt^® fccteetgggc 2 0400 
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tcagaggatc etcccacctc agcctccCaa at^getagga ctagaggtgc ccaccagcat 20460 
gctcagcaga tttttaaatc tttttgtaga gat^aggttt fcgctat$ttg cCCA^ctgg 20520 
tctcgaatftc etggcctcga gcgatcctcc CsiCCttggeo tcccaaagca ctgggattac 2O590 
aaaccrtgagc cactgcgoec ageagatttc tctttaacac ctagatttca gcctgA$C"ca 20S40 
ggcaggcatt cctgaatgaa ccagtagtoC tgeteceaga agaagaggtc ctcctccgtg 20700 
tgacacagtc cc^cttggc ccttgcaggg attggatct-g ggdfcccctgg atttaascto 2D760 
a^jggccatee tcataacagc ctc^caA93Cf tgggattagc ttcccagttc acaagggaag 20620 
aaaccaagae ttgagaaggt caaggtctgg ccagaccc^c acatettgga cccteatacc 20BB0 
gcctcgaggc cccatgctge ccte tgcetg ctccagatgt gaatactgct gsccctggct 20940 
ggccccggct ggeoccgagg gtectaggga tga scarce agcecagggs gagctcagcc 21QOD 
ccttgtgcct ctgcccctte eeacetcetg cggaggccag tcgactcacc cacaaagggc 210&0 
caggcactgt ggggatagat cagctaacaa aacsgttgafc gcttcctgcc cttctgggcc 21130 
ttarattttg gctggc^gaa gagggg3gag gcagactgta agcantii^gc gcrtAt^agta 2 USD 
ggttgcctgg aagtaatgtt agatcacgtt ae§gaaaaca ggaaagagca gagcgacaag 21240 
tgctggggtg cgtg^tgcag ggaaggcagc tggctgctgc tggtgtggtc agagtgggcc 213 DO 
ctcatggaga agactgcatt cgagca9**A cttgaagggg gtgaggggtg agcctagaga 213SQ 
tatctggggc ag^gcagtcc aggcagaggg gacagccggt gbCA-ageeca ggaeaggagt 21420 
gtgC^tggtg tgccsgtttc aggtf^gagg ccagtgtgca gaggcaaggt gagaacgc-aa 9^460 
^ggagagcag tggeggagac gggtgggaac gaggfcc:*gae! etgctggcct ccagcctctg 21540 
catggggctt ggcte-ttgOt ^ggagcaatg ggaagcagta cacagtttca tgeaggggga 21500 
gaaggcctgt ettgggttgc aggggcacgc tgt^eagct gggatcagag agaggagctt 21560 
ptaggccagt tgttatgtgg to*cfc*acgggc cagatggcca tggcttacct caefcteaggg 21720 
sggctgtgag a^gcactcsg aatctggatg =t-gc £ Ctt.gggg=gtgggcccca ctggatttco 21760 
tgatggacct ggtgt-gg^gl' gtgagaggag ggtgtgtttg gctgcfrgcag aeaggagaat 210 40 
ggagttgcci t^cgogtgat ggggatggct ^tgg^ggag aggtttgggg tgagggaatc 21900 
aggaactgag tgctggac^t ggeaagtctg aaggcgcagt ggtc^t^Af; fceagagacct 21960 
tggagttgga. gatggaggtg tgggagtcct goacagttag atgtagtgtt taccgcgaga 22020 
aggaacaggg cttgcggcc* gtfeetcetgt gttcccgtga cccagggcsg gg rt! *gg a gsg 320B0 
gcct-gagcct gccgagtgac tgggacctcc ttceaggaga tcatgcaogc cctgaagatg 22*4 0 
acctggcacg tgcactgctt tacctgtgct gcctgcaaga eg reseat ceg gaacagggee 22200 
ttcta^atgg aggagggegt gewtattge gagegaggta cccactggcc aF?tg*gggtg 22 260 
aggagggatg gtgcatgggg caggcatgaa tcc^ggtc^t ctttctctct gcccccattc 22320 
teagactatg agaagatgtt t-g^«ac*gaaa tgccatggct gtgacttwo gAtcsgacgct 223S0 
ggg^accget tCf:tggsggc cctgggcttc ngctggeatg acacctgctt cgtctgtgcg 22-^40 
$tgagagccc ogcccctcga Adtgagcccc aagcccaccg gccctct^tt. esttccccag 225D0 
gagatgeagg ^gaagttggg aaggggcetc t-cctgcitgcc cccaacccca tgt.gvtctgg^ 22SfO 
rtCFtttgctgt ccttagatat gl.aagatcaa cctggaagga aag^cCtt^t. accccaagaa 22620 
ggacaggoct fitcrtgeaaga gccatgcctt ctfiUc?atgtg tgagcccctt ctgcccacog 22<?R0 
Ctgccgcggt ggccccteiRV ctgaggggee tggagtcgtg gcpOtg^tU tctgggtagg 22740 
gctggcaatg $ttgccttaa ccctggctw tgg^ccgagc ctggggctcc ctgggccctg 22 BOO 
ccccacccac cttatcptc^i caccccactc cctccaccac cacageacac: egatgetgge 22B60 
^acacfoagcc ccctttcaco tcc^gtg cca caataaacct gtaccc^gct ^tgtcttgtg 22920 
tgcccttccc ctgtg^atcc ggaggggcag aatttg^ggc acgtggcagg gtggagagta 22980 
Agatggtttt cttgggctgg ccoitctgggt ggtcctcgtg atgoag^c^t ggegggctea 220^0 
tggttagtgg ^gg^ggtaca ggcgagaccc catgtgccag gcccggtgcc cacagacatg 23100 
, ? ^gggagcca ctggtctsgo ctggctt.gga ggttagagaa gggtagttau gw^ggtiagt 221GD 
tagcatggtg gctcat.gcct gtgatcccag cactttgH*- 21 ggccaaggtg ggcagatege 23220 
ttgaggtcag gagttcgaga cct^tggcc aacacggtga aaoagegtet ^gtaaaaa 2 3 280 
tacaaaaatt agccgagtgt ggtggggcat. gcct.gt^^^c? ocagccactc aggaggctga 23340 
ggegggaaaa tcscttgaac ctg^gaagtg gaggttgcag tgagctgagc* tvA^accaet 23^100 
gcgcgcgagc ctgggt.ggca gatggcagag cgagQcacUQ cttcaaaaaa aaaaaaaaaa 23460 
oaaaaaaaaa gaagggtagt tgt^gttggg ggtggatctg cagagatatg gtgt^gaaaa 23520 
cagcaatggc ^A^agcaaag tcctggaggg gccA$c?tgccr gtccaaacag aagaaggca^ 23SPO 
ggctggagag ggtagcccfct aggtcctggg aagcc<tcgag tgpd^ggeag tagagetggg 23640 
gctgtctt?tt gaggttaggg cagggD?i!*9g eaeagcagag ttcgaaatag gttt^tgttg 23700 
tattgeagaa aag-aggectfC! agaacactga gggagtgct^g g^gggaggct gggaggagvja 23 760 
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gttgcagcag ggccta^g^ og^gggccag gcaagggagg ggeagagagt aatatggcag 23620 
agatgggacc cagtggcagg tccgggggat gagggatgga gagaaggac«a ggagcsgttgc 
caggcatctg gcctatAOCA gAceitgctca cgctgtctcc cgcgaaccfcc ctagcaacct 23 940 
tgcgecgttg tctgcaatca cttattteat tttttctttt ttaactttaa ttttttttgt 24DQ0 
ttttaagaga caggatctcc ctaggttgcc ojijgctgott tc«aactcct ggge-tcaagc 2406 0 
aattcttcCt ecttagcccc aaagtgctgg eattacaggt gtgagcca.cc atgcctggca 24120 
caettatttt ctagat$A^9 tfacagaaaga tt@ggagact> tcuiCCaaggt cacgctgtca 24190 
ttgagcc^fc-g agecagacta gaatccaggc ctgsagctgg gtgcgctgtc ccaggaetgg 24210 
ctggcactga gtaceatttg ceagcsagCA tctctetsgg ^gcfcgsett ct^cccggta 24300 
c-ct$gagei*c tgtagacctt ggtggtggcg cogtcactct ggggcttcct gcctcccact 24360 
gatgcccgca ccacCCtagc* ggemetgtca tcteteetgt Cw^gCtftci gactggaaag 24420 
aofc^aa^Aga agccttaagt aggocaggac agctcagtgt gccatggctg cccgtce ttc 2 44 BO 
agtesgcccct gsicatgixgSA ectgc&atfAC afcct-gtfc&gt cttetc^acft ggcccttggc 24540 
ccg^tcccet ttaagagaog agaagggctg ggcacggt-ga ctc-acacctc taatcccagc 24B00 
sctttgsaag getgaggctg gagaaggget ccagcttagg agttcaggac c*agoetgg$e 24EG0 
aacatggtga gaccctgttt tgttttgttt tttgtttttt. tgagatggag tcttgctctg 24 720 
tcgcccaggc tggagtgcag 24740 
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